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DNA DEMETHYLASE , THERAPEUTIC AND 
DIAGNOSTIC USES THEREOF 

BACKGROUND OF THE INVENTION 

5 (a) Field of the Invention 

The invention relates to a novel enzyme, DNA 
demethylase, therapeutic and diagnostic uses thereof, 
(b) Description of Prior Art 

Many lines of evidence have established that 

10 modification of cytosine moieties residing in the dinu- 
cleotide sequence CpG in vertebrate genomes is involved 
in regulating a number of genome functions such as 
parental imprinting, X-inactivat ion, suppression of 
methylation of ectopic genes and differential gene 

15 expression (Szyf, M. (1996) Pharmacol. Ther. 70, 1-37). 
DNA methylation performs its function of differentially 
marking genes because the distribution of methylated 
CpGs is tissue- and site- specific forming a pattern of 
methylation (Szyf, M. (1996), Pharmacol. Ther. 70, 1- 

20 37) . It is clear that the pattern of methylation is 
fashioned by a sequence of methylation and demethyla- 
tion events (Brandeis, M. et al . (1993) Bioassays 15, 
709-713) during development and is maintained in the 
fully differentiated cell (Razin, A. et al. (1980) Sci- 

25 ence 210, 604-610) . While it was originally suggested 
that DNA demethylation is accomplished by a passive 
loss of methyl groups during replication (Razin, A. et 
al. (1980) Science 210, 604-610), it is now clear that 
an active process of demethylation occurs in embryonal 

30 cells (Frank, D. et al . (1991) Nature 351, 239-241), in 
differentiating cell lines (Razin, A. et al . (1986) 
Proc. Natl. Acad. Sci. USA 83, 2827-2831; Szyf, M. et 
al. (1985) Proc. Natl. Acad. Sci. USA 82, 8090-8094) 
and in response to estrogen treatment (Saluz, H.P. et 

35 al. (1986) Proc. Natl. Acad. Sci. USA 83, 7167-7171). 
Two modes of demethylation have been documented: site 
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specific demethylation that coincides in many instances 
with onset of gene expression of specific genes and a 
general genome wide demethylation that occurs during 
early development in vivo during cellular differentia- 
5 tion and in cancer cells (Feinberg, A. P. et al . (1983) 
Nature 301, 89-92; Razin, A. et al . (1986) Proc. Natl. 
Acad. Sci. USA 83, 2827-2831). The global demethyla- 
tion is consistent with the hypothesis that a general 
demethylase activity which is activated at specific 

10 points in development or oncogenesis exists. It has 
been hypothesized that one mechanism regulating the 
pattern of methylation * is the control of expression of 
methyltransferase (Szyf, M. (1991) Biochem. Cell Biol. 
69, 764-767) and demethylase activities (Szyf, M. (1994) 

15 Trends Pharmacol. Sci. 7, 233-238). Although exten- 
sive information has been obtained on the enzymatic 
activity responsible for methylation and the regulation 
of its expression in the last two decades (Szyf, M. 
(1996) Pharmacol. Ther. 70, 1-37), the identity of the 

20 demethylase has remained a mystery. It is clear how- 
ever that to fully understand how patterns of methyla- 
tion are formed and maintained and to determine their 
role in development, physiology and oncogenesis, one 
has to identify the demethylase enzyme (s). Two main 

25 difficulties have inhibited the identification of this 
enzyme. First, it is believed that demethylation of a 
methylated cytosine is chemically highly unlikely since 
it involves breaking a very stable C-C bond. Second, 
demethylation occurs at very defined stages in develop- 

30 ment (Brandeis, M. et al . (1993) Bioassays 15, 709-713) 
and identifying an adequate tissue source for this 
enzyme is critical. 

Whereas no bona fide demethylase has been iden- 
tified to date, alternative biochemical mechanisms 

3 5 involving exchange of methylated cytosines with non- 
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methylated cytosines have been described. One previ- 
ously proposed mechanism is removal of the methylated 
base by a glycosylase and its replacement with a non- 
methylated nucleotide utilizing an M excision-repair " 
5 mechanism (Razin, A. et al . (1986) Proc. Natl. Acad. 
Sci. USA 83, 2827-2831). Glycosylase activities that 
can remove methylated cytosines from DNA have been dem- 
onstrated by Vairapandi and Duker (Vairapandi, M. et 
al. (1993) Nucl. Acids Res. 21, 5323-5327) and more 

10 recently by Jost (Jost, J. P. et al . (1995) J. Biol. 
Chem. 270, 9734-9739) . However it is not clear 

whether this activity is responsible for the general 
demethylation observed in cellular differentiation. 
The fact that the activity identified by Jost acts spe- 

15 cifically on hemimethylated sequences (which is not the 
natural substrate in most cases) and can remove thymi- 
dines as well as 5-methylcytosines, supports a repair 
function for this glycosylase-demethylase (Jost, J. P. 
et al. (1995) J. Biol. Chem. 270, 9734-9739). An 

20 alternative mechanism involving a RNA dependent activ- 
ity has been recently described by Weiss et al . (Weiss 
et al . , 1996). This proteinase-insensitive RNA depend- 
ent activity has been shown to catalyze the excision 
and replacement of a methylated CpG dinucleotide with a 

2 5 nonmethylated CpG dinucleotide that is contained in a 

DNA -RNA hybrid molecule (Weiss, A. et al . (1996) Cell 
87, 709-718) . This activity which was identified in 
differentiating cells in culture was proposed to be 
involved in demethylation during development. These 

3 0 previous findings demonstrate that the common accepted 

model in the filed has been that a bona fide demethy- 
lase does not exist. 

It has been previously proposed that the exten- 
sive hypomethylation observed in cancer cells might be 
35 a consequence of activation of demethylase activity by 
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oncogenic pathways (Szyf, M. (1994) Trends Pharmacol. 
Sci. 7, 233-238/ Szyf, M . et al . (1995) J. Biol. Chem. 
270, 12690-12696) . In accordance with this hypothesis 
we have shown that ectopic expression of v-Ha-ras had 
5 induced demethylation activity in the cells (Szyf, M. 
et al. (1995) J. Biol. Chem. 270, 12690-12696). Using 
an assay that directly measures the conversion of 3' 32 P 
labeled methyl dCMP (mdCMP) into dCMP, we have shown 
that nuclear extracts prepared from P19-Ras transfec- 

10 tants bear high levels of demethylase activity (Szyf, 
M. et al. (1995) J. Biol. Chem. 270, 12690-12696). 
Building on this observation, we hypothesized that can- 
cer cell lines were a good source for demethylase. 
However, it is not evident that Ras expression in pl9 

15 cells does reflect the situation in cancer cells. P19 
is an embryonic cell and expression of Ras might be 
differentiating them. 

It would be highly desirable to be provided with 
a bona fide DNA demethylase (DNA dMTase) to alter 

20 developmental programs for therapeutic and biological 
use . 

SUMMARY OF THE INVENTION 

In accordance with the present invention, we 
25 demonstrate the purification of a bona fide DNA demeth- 
ylase (DNA dMTase) from a human lung cancer cell line 
A549, determine its kinetic parameters and substrate 
specificity. The DNA dMTase activity identified in 
this study converts methyl -dCMP (mdCMP) residing in the 
30 dinucleotide sequence mdCpG into dCMP whereas the 
methyl group is released as a volatile residue which 
was identified to be methanol. The activity is puri- 
fied away from any trace amounts of dCTP, is insensi- 
tive to the DNA polymerase inhibitor ddCTP, is not 
35 affected by the presence of methyl dCTP (mdCTP) in the 
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reaction and does not exhibit exonuclease or glyco- 
sylase activities. The identification of this new 
enzyme points out to new directions in our understand- 
ing of how DNA methylation patterns are formed and 
5 altered. 

One aim of the present invention is to provide a 
bona fide DNA demethylase (DNA dMTase) . 

In accordance with the present invention there 
is provided a DNA demethylase enzyme having about 
10 40 KDa, and wherein the DNA demethylase enzyme is over- 
expressed in cancer cells and not in normal cells. 

In accordance with the present invention there 
is provided a cDNA encoding human demethylase which 
comprises a sequence set forth in SEQ ID NO:l. 
15 In accordance with the present invention there 

is provided two mouse cDNAs homologous to the human 
cDNA, wherein the cDNA encoding mouse demethylase hav- 
ing a sequence set forth in SEQ ID NOS:5-7. 

In accordance with the present invention there 
20 is provided a different human cDNA which encodes a pro- 
tein homologous to the human demethylase having a 
sequence set forth in SEQ ID NO:3. 

In accordance with the present invention there 
is provided the use of the expression of demethylase 
25 cDNAs to alter DNA methylation patterns of DNA in vitro 
in cells or in vivo in humans, animals and in plants. 

The demethylase cDNAs expression may be under 
the direction of mammalian promoters, such as CMV. 

The demethylase cDNAs expression may be under 
30 plant specific promoters to alter methylation in plants 
and to allow for altering states of development of 
plants and expression of foreign genes in plants. 

The demethylase cDNAs expression may be in the 
antisense orientation to inhibit demethylase in cancer 
35 cells for therapeutic processes. 
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The expression of demethylase cDNA in mammalian 
cells may be to alter their differentiation state and 
to generate stem cells for therapeutics, cells for ani- 
mal cloning and to improve expression of foreign genes. 
5 In accordance with the present invention there 

is provided the use of the expression of demethylase 
cDNAs in bacterial or insect cells for production of 
large amounts of demethylase. 

In accordance with the present invention there 
10 is provided the use of the expression of demethylase 
cDNAs for the production of protein in vertebrate, 
insect or bacterial or plant cells, such as antibodies 
against demethylase . 

In accordance with the present invention there 
15 is provided the use of the sequence of demethylase 
cDNAs as a template to design antisense oligonucleo- 
tides and ribozymes. 

In accordance with the present invention there 
is provided the use of the predicted peptide sequence 

2 0 of demethylase cDNAs to produce polyclonal or mono- 

clonal antibodies against demethylase . 

In accordance with the present invention there 
is provided the use of expression of cDNAs in two 
hybrid systems in yeast to identify proteins interact- 
25 ing with demethylase for diagnostic and therapeutic 
purposes . 

In accordance with the present invention there 
is provided the use of expression of cDNAs in bacte- 
rial, vertebrate or insect cells to produce large 

3 0 amounts of demethylase for obtaining a x-ray crystal 

structure and for high throughput screening of demethy- 
lase inhibitors for therapeutics and biotechnology. 

In accordance with the present invention there 
is provided a volatile assay for high throughput 
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screening of demethylase inhibitors as therapeutics and 
anticancer agents which comprises the steps of: 

a) using transcribed and translated demethylase 
cDNAs in vitro to convert methyl -cytosine pres- 

5 ent in methylated DNA samples to cytosine pres- 

ent in DNA and volatilize methyl group; 

b) determining the absence or minute amount of 
volatilize methyl group as an indication of an 
active demethylase inhibitor. 

10 In accordance with the present invention there 

is provided a volatile assay for the diagnostics of 
cancer in a patient sample which comprises the steps 
of: 

a) determining demethylase activity in patient sam- 
15 pies by assaying conversion of methyl -cytosine 

present in methylated DNA to cytosine present in 
DNA and its volatilization as methyl groups 
released as methanol; 

b) determining the presence or minute amount of 
2 0 volatilized methyl released as methanol groups 

as an indication of cancer in the patient sam- 
ple . 

In accordance with the present invention there 

is provided the use of an antagonist or inhibitor of 

25 DNA demethylase for the manufacture of a medicament for 

cancer treatment, for restoring an aberrant methylation 

pattern in a patient DNA r or for changing a methylation 

pattern in a patient DNA. 

Such an antagonist is a double stranded oligonu- 

30 cleotide that inhibits demethylase at a Ki of 50nM, 

such as fc m GC m GC m GC m G] . 

[G m CG m CG m CG m cJ n 

The inhibitors include, without limitation an 

ant i -DNA demethylase antibody, an antisense of DNA 

35 demethylase or a small molecule such as any derivative 

of imidazole . 
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The change of the methylation pattern may acti- 
vate a silent gene. Such an activation of a silent gene 
permits the correction of genetic defect such as found 
for P-thalassemia or sickle cell anemia. 
5 The DNA demethylase of the present invention may 

be used to remove methyl groups on DNA in vitro such as 
needed for cloning DNA . 

The DNA demethylase of the present invention or 
its cDNAs may be used, for changing the state of dif- 

10 ferentiation of a cell to allow gene therapy, stem cell 
selection or cell cloning. 

The DNA demethylase of the present invention or 
its cDNAs may be used, for inhibiting methylation in 
cancer cells using vector mediated gene therapy. 

15 In accordance with the present invention there 

is provided an assay for the diagnostic of cancer in a 
patient, which comprises determining the level of 
expression of DNA demethylase by either RT-PCT, ELISA 
or volatilization assay of the present invention in a 

20 sample from the patient, wherein overexpression of the 
DNA demethylase is indicative of cancer cells. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1A to IB illustrate the purification of 
25 demethylase (DNA dMTase) from human A549 cells; 

Figs. 2A and 2C illustrate that DNA dMTase is a 
protein inhibited by RNA and not by ddCTP, mdCTP; 

Figs. 2B and 2D illustrate the kinetics of DNA 
dMTase activity; 
30 Figs. 3A to 3C illustrate the product of DNA 

dMTase activity is cytosine and it exhibits no exonu- 
clease or glycosylase activity; 

Figs. 4A-4C illustrate the demethylation reac- 
tion releases methanol as a volatile residue; 
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Fig. 4D illustrates the transfer of a proton 
from water to regenerate cytosine; 

Figs. 4E-4F illustrate that the volatile product 
is methanol; 

5 Fig. 5 illustrates the suggested demethylation 

reaction; 

Figs. 6A-6D illustrate the substrate Specificity 
of DNA dMTase; 

Figs. 7A-7D illustrate chromatographic isolation 
10 of dMTase from human A54 9 cells; 

Figs. 8A-8B illustrate the alignment between the 
MDB domain of MeCP2 and demethylase and the predicted 
amino acid sequence of human demethylase; 

Fig. 8C illustrates the mRNA encoded by demethy- 

15 lase; 

Figs. 9A-9F illustrate the cDNA and their pre- 
dicted amino acid of demethylases and homologues of the 
present invention (SEQ ID NOS:l-8); 

Figs. 10A-B illustrate a mammalian expression 
20 vector of dMTase and in vitro translated dMTase poly- 
peptide ; 

Fig. 10C illustrates that in vitro translated 
DNA dMTase releases volatile methyl residues from meth- 
ylated DNA; 

25 Fig. 10D illustrates that in vitro translated 

DNA dMTase transform methylated cytosines to cytosines; 

Fig. 11A illustrates that transiently trans- 
fected demethylase releases volatile residues from 
methylated DNA; 

30 Fig. 11B illustrates the polypeptide expressed 

from transiently transfected demethylase; 

Figs. 11C-11E illustrate that transiently trans- 
fected demethylase transforms methylated cytosines to 
cytosines in a protein dependent manner; 
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Fig. 11F illustrates that the transformation of 
methylated cytosine to cytosine by transiently trans - 
fected demethylase depends on the concentration of sub- 
strate; 

5 Fig. 12A illustrates that transiently trans - 

fected demethylase catalyzes the transfer of a proton 
from tritiated water to regenerate cytosine; 

Fig. 12B illustrates that the cloned demethylase 
releases methanol from methylated DNA; 
10 Figs. 13A-13C illustrate that the cancer cells 

express demethylase activity whereas normal cells do 
not ; 

Fig. 13D illustrates that demethylase mRNA is 
highly express in cancer cells; 
15 Fig. 14A illustrates demethylase bacterial ret- 

roviral and mammalian expression vector; 

Fig. 14B illustrates inhibition of demethylase 
activity by a specific inhibitor; 

Fig. 14C illustrates inhibition of tumorigenesis 
20 in vitro by an inhibition of demethylase; 

Fig. 15 illustrates inhibition of tumorigenesis 
in cell culture by induced expression of demethylase 
antisense vector; 

Fig. 16 illustrates the inhibition of demethy- 
2 5 lase by a small molecule inhibitor imidazole; and 

Fig. 17 illustrates a model for the inhibition 
of cancer growth by an inhibition of demethylase. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The pattern of methylation is fashioned during 

development by a sequence of methylation and demethyla- 
tion events. The identity of the demethylase has 
remained a mystery and alternative biochemical activi- 
ties have been shown to demethylate DNA but no activity 

35 that can truly remove methyl groups from DNA has been 
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shown to date. Utilizing human lung carcinoma cells as 
a source for demethylase activity we demonstrate that 
mammalian cells bear a bona fide DNA demethylase (DNA 
dMTase) activity. DNA dMTase transforms methyl-C to C 

5 by catalyzing replacement of the methyl group on the 5 
position of C with a hydrogen derived from water. DNA 
dMTase demethylates both fully methylated and hemimeth- 
ylated DNA, shows dinucleotide specificity and can 
demethylate mdCpdG sites in different sequence con- 

0 texts. This enzyme is different from previously 
described demethylation activities: it is proteinase 
sensitive, activated by RNase and releases different 
products . 

DNA dMTase is a novel enzyme showing a new and 

5 unexpected activity that has not been previously 
described in any organism. The finding of a bona fide 
demethylase, points out new directions in our under- 
standing of the biological role of DNA methylation. 

In spite of the fact that it was previously 

0 shown that Ras expression in pl9 cells can induce 
demethylation activity. It was not clear whether this 
demethylation activity is indeed a bona fide demethy- 
lase. One would predict that demethylase is present in 
embryonal cells. It was surprising to see that demeth- 

5 ylation activity is present in cancer cells. The find- 
ing of high levels of demethylase in A549 cells is 
indeed an unexpected discovery. 

In accordance with the present invention, it is 
shown and demonstrated that demethylation occurs by 

0 removal of a methyl group from methylated cytosine in 
DNA, that a hydrogen from water replaces the methyl 
group at the 5' position, that the resulting methyl 
group reacts with the remaining hydroxyl from water to 
generate methanol which volatilizes (Fig. 4E-F) . Thus, 
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bona fide demethylation of DNA involves the following 
reaction: 

CH 3 -cytosine- (DNA) +H-OH demeth V lase -> H-cytosine + CH 3 -OH 

5 The cDNA cloned in accordance with the present 

invention is the demethylase since it can convert 
methyl -cytosines in DNA to cytosines and volatilize the 
methyl groups on DNA when transcribed and translated in 
vitro which are released as methanol. This is a novel 
10 cDNA encoding a biochemical activity that has been not 
described before. 

In accordance with the present invention, there is 
shown a model for the inhibition of cancer growth by an 
inhibition of demethylase (Fig. 17) . 

15 

EXPERIMENTAL PROCEDURES 
Cell Culture 

A549 Lung Carcinoma cells (ATCC: CCL 185) were 
grown in Dulbecco's modified Eagle's medium (with low 

20 glucose) supplemented with 10% fetal calf serum, 2 mM 
glutamine, 10 U/ml cif rof loxacin. Human Skin Fibro- 
blasts #72-213A MRHF were obtained from BioWhittaker , 
Bethesda and were grown in Dulbecco's modified Eagle's 
medium supplement with 2% fetal calf serum, 2 mM gluta- 

25 mine. H446 Lung carcinoma cells (ATCC: HTB 171) was 
grown in RPMI 1640 medium with 5% fetal calf serum. 
Preparation of nuclear extract 

Nuclear extracts were prepared from A54 9 cul- 
tures at near confluence as previously described (Szyf 

30 et al., 1991; Szyf et al.,1995). The cells were tryp- 
sinized, collected and washed with phosphate-buffered 
saline and suspended in buffer A (10 mM Tris, pH 8.0, 
1.5 mM MgCl 2/ 5mM KCl, 0.5% NP-40) at the concentration 
of 10 s cells per ml for 10 min. at 4°C. Nuclei were 

35 collected by centrif ugation of the suspension at 1000 g 



WO 99/24583 PCT/CA98/01059 

- 13 - 



for 10 minutes. The nuclear pellet was resuspended in 
buffer A (400 /il) and collected as described in the 
experimental procedures. A nuclear extract was pre- 
pared from the pelleted nuclei by suspending them in 
5 buffer B (20 mM Tris, pH 8.0, 25% glycerol, 0.2 mM EDTA 
and 0.4 mM NaCl) at the concentration of 3.3xl0 8 nuclei 
per ml and incubating the suspension for 15 min. at 
4°C. The nuclear extract was separated from the 
nuclear pellet by centrif ugation at 10,000g for 30 min- 

10 utes. Nuclear extract were stored in -80 °C for at least 
two months without loss of activity. 
Chromatography on DEAE-Sephadex 

A freshly prepared nuclear extract (1 ml , 1.1 
mg) was passed through a Microcon™ 10 0 spin column, the 

15 retainant was diluted to a conductivity equivalent to 
0.2 M NaCl in buffer L and applied onto a DEAE-Sephadex 
column (Pharmacia) (1.0 x 5 cm) that was preequili- 
brated with buffer L (10 mM Tris-HCl, pH 7.5, 10 mM 
MgCl 2 ) containing 0.2 M NaCl at. a flow rate of 1 

2 0 ml/min. The column was then washed with 15 ml of the 
starting buffer (buffer L +■ 0.2 M NaCl) and proteins 
were eluted with 5 ml of a linear gradient of NaCl 
(0.2-5.0 M) . 0.8 ml fractions were collected and 
assayed for demethylase activity after desalting 

25 through a Microcon™ 10 spin column (Amicon) and resus- 
pension of the retainant in 0.8 ml buffer L. DNA 
demethylase eluted between 2-5.0 M NaCl. 
Chromatography on S-Sepharose 

Active DEAE-Sepharose column fractions were 

30 pooled, adjusted to 0.1 M NaCl by dilution and loaded 
onto an S-sepharose column (Pharmacia) (1.0 x5 cm) 
which had been preequilibrated with buffer L containing 
0.2 M NaCl at a flow rate of 1 ml/min. Following wash- 
ing of the column as described in experimental proce- 

35 dures, the proteins were eluted with 5 ml of a linear 
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NaCl gradient (0.2-5.0M). 0.5 ml fractions were col- 
lected and assayed for DNA demethylase activity after 
desalting and concentrating to 0.2 ml using a Microcon™ 
10 spin column. DNA demethylase activity eluted around 
5 5.0 M NaCl. 

Chromatography on Q-Sepharose 

Active fractions from S-sepharose column were 
pooled, adjusted to 0.2 M NaCl by dilution and applied 
onto a Q-sepharose (Pharmacia) column (1.0 x5 cm) which 

10 had been equilibrated as described in the experimental 
procedures at a flow rate of 1 ml/min. The column was 
washed and the proteins were eluted with a linear NaCl 
gradient (0.2- 5.0 M) . Fractions (0.5 ml) were col- 
lected, assayed for demethylase activity after desalt - 

15 ing and concentrating to a final volume of 0.2 ml as 
described in the experimental procedures. The demethy- 
lase activity eluted around 4.8-5.0 M NaCl. 
Gel-Exclusion Chromatography on DEAE-Sephacel 

The pooled fractions of Q-sepharose column were 

20 adjusted to 0.2 M NaCl, loaded onto a 2.0 x 2.0 cm 
DEAE-Sephacel column (Pharmacia) and eluted with 10 ml 
of buffer L containing 0.2 M NaCl. The fractions (0.8 
ml) were collected and assayed after concentration to 
about 18 0 ill with a Microcon™ 10 spin column for DNA 
.25 demethylase activity. The activity was detected at 
fraction 4, which is very near the void volume 
(~200kDa) . 

Assay of DNA demethylase activity 

To directly assay DNA demethylase activity in 
3 0 vitro two independent methods were applied. 

(A) To assay the conversion of methyl -dCMP (mdCMP) to 
dCMP we used a previously described method (Szyf et 
al., 1995). Briefly, a 32 P labeled, fully methylated 
poly [mdC 32 PdG] n substrate was prepared as follows. One 
35 hundred ng of a double-stranded fully methylated 



WO 99/24583 PCT/CA98/01059 



(mdCpdG) oligomer (Pharmacia) were denatured by boil- 
ing, which was followed by partial annealing at room 
temperature. The complementary strand was extended 
with Klenow fragment (Boehringer Mannheim) using 
5 methyl-5-dCTP (mdCTP, 0.1 mM) (Boehringer Mannheim) and 
[a- 32 P] GTP (100 /iCi, 3000 Ci/mmol) , and the unincorpo- 
rated nucleotides were removed by chromatography 
' through a NAP-5 column (Pharmacia) . The NAP-5 chroma- 
tography was repeated to exclude minor contamination 

10 with unincorporated nucleotides. As a control a non- 
methylated poly [dC 32 pdG] n substrate was similarly pre- 
pared except that a nonmethylated dCpdG oligomer served 
as a template and dCTP was used in the extension reac- 
tion. The column fractions (30 /il) , described in the 

15 experimental procedures were incubated with 1 ng of 
poly [mdC 32 pdG] n substrate for 1 hour at 37 °C in a 
buffer L containing 25% glycerol (v/v) and 5 mM EDTA. 
The reacted DNA as well as a nonmethylated 
poly [dC 32 pdG] n and methylated [mdC 32 pdG]n nonreacted con- 

2 0 trols were purified by phenol/chloroform extraction and 
subjected to micrococcal nuclease digestion (100 (xg at 
10 /il) and calf spleen phosphodiesterase (2/zg) 
(Boehringer) (Pharmacia) to 3' mononucleotides for 15 
hours at 3 7°C. The digestion products were loaded onto 

25 a thin layer chromatography plate (TLC) (Kodak, 13255 
Cellulose) , separated in a medium containing, 132 ml 
Isobutyric acid: 40 ml water: 4 ml ammonia solution, 
autoradiographed and the intensity of the different 
spots was determined using a phosphorimager (Fuji, BAS 

30 2000) . 32 P labeled substrates and tritium labeled sub- 
strates were phosphoimaged using BAS 2000 plate and 
BAS-TR2 04 0 phosphorimager plate respectively. 
(B) The second method determined removal of methylated 
residues from methylated DNA by measuring disappearance 

35 of 3 H-CH 3 or 14 C-CH 3 from the reaction mixture. 100 ng 
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of poly [dCdG]n double stranded DNA was methylated 
using SssI methylase (New England Biolabs) and an 
excess of [ 3 H-methyl AdoMet (80 Ci/mmol; New England 
Nuclear) ] . The tritiated methyl group containing DNA 
5 was purified from labeled AdoMet using NAP-5 column 
chromatography. All column purified fractions of DNA 
demethylase were assayed using the tritiated substrate. 
In a typical assay, 1 ng of DNA was incubated (at a 
specific activity of 4 xl0 6 dpm/mg) with 30 fil of column 

10 fraction for one hour at 37 °C in buffer L. To deter- 
mine the number of methyl groups remaining in the DNA 
following incubation with the different fractions, 250 
fil of water were added and the mixture was incubated at 
65 °C for 5 minutes. One hundred fil of the reaction 

15 mixture were withdrawn for liquid scintillation count- 
ing. Controls received similar treatment except that 
in place of a column fraction, an equal volume of 
buffer L was added. The number of methyl groups that 
were removed from the DNA by the different fractions 

2 0 was determined by subtracting the remaining counts in 

each of the fractions from the counts remaining in the 
control. All tests were carried out in triplicates. 
The results are presented as picomole methyl group 
removed. One unit of DNA dMTase activity is defined 
25 as: amount of enzyme that releases one picomole of 
methyl group from methylated dCpdG substrate in one 
hour at 37 °C. 

Methyl removal assay using double- labeled substrates 

To determine whether the methyl group leaves the 

3 0 DNA and not any non-specific removal of tritium, we 

prepared SK plasmid DNA containing a tritiated hydrogen 
at the 6' position of cytosine and thymidine by growing 
the plasmid harboring bacteria in the presence of deoxy 
[6- 3 H] Uridine (22 Ci/mmol; Amersham) (lOptCi/ml) . The 
35 [6- 3 H] -cytosine containing pBluescript SK( + ) was puri- 
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fied according to standard protocols and was methylated 
using an excess of [ 14 C-methyl] AdoMet (59 mCi/mmol; 
Amersham) (10 /iCi per 100 fil reaction) and SssI methy- 
lase. The double labeled DNA substrate was purified 
5 twice on a NAP-5 column. 15 fil of DNA dMTase were 
incubated with 1 ng- of double labeled DNA (specific 
activity of 2000 dpm/ng) for 1 hour at 37°C. Follow- 
ing incubation, the remaining 14 C versus 3 H counts were 
determined as described in the experimental procedures 

10 by scintillation counting (Wallac) . The 14 C counts were 
normalized against 3 H counts. The controls received 
similar treatment except that instead of DNA dMTase, an 
equal amount of distilled water was added to them. 

To determine the number of 3 H-CH 3 in the gaseous 

15 phase, 1 ng of 3 H-CH 3 poly [dCpdG] DNA were incubated 
with DNA dMTase overnight in a sealed tube (Pierce, 
Illinois, USA). 0.8 ml of air were removed from the 
tube using a gas tight syringe (Hamilton, Reno, Nevada) 
and injected into a sealed gas tight scintillation vial 

20 containing 10 ml OptiPhase scintillation fluid (Wallac, 
UK) and counted. As a control the DNA was incubated 
with an equal volume of buffer L and treated similarly. 
Synthesis of other methylated dC dinucleotides 

Poly [mdC 32 pdA] and [mdC 32 pdT] substrates were 

25 prepared as follows. About 0.5 M9 of 20 mer oligonu- 
cleotides 5'(GG)103', 5'(GT)103' and 5'(GA)103' were 
boiled and annealed at room temperature with oligonu- 
cleotide 5'CCCCCC3', 5'CACACA3' and 5'CTCTCT3' respec- 
tively. The complementary strand was extended with 

3 0 Klenow fragment using m5dCTP (Boehringer Mannheim) and 
either [a 32 P] dATP (100/iCi, 3000Ci/mmol) or [a 32 P] dTTP 
(100 /zCi, 3000 Ci/mmol) respectively. The unincorpo- 
rated nucleotides were removed by chromatography 
through a NAP-5 column. Hemimethylated mdCpG substrate 

35 was prepared in a similar manner except that a nonmeth- 



WO 99/24583 



- 18 - 



PCT/CA98/01059 



ylated poly dCpdG substrate (Boehringer) was used as 
template and mSdCTP and [<x 32 P] dGTP were used for exten- 
sion as described in the experimental procedures. 
Assay for nuclease and glycosylase activity 
5 [ 32 pmdCpdG]n substrate which included a labeled 

32 P 5 ' to mdC was prepared as follows. About 100 ng of 
poly dCpdG DNA were boiled and partially annealed at 
room temperature. [a 32 P] dCTP and cold dGTP were used for 
complementary strand extension as described in the 

10 experimental procedures. The free nucleotides were 
separated using NAP-5 column chromatography. The puri- 
fied [ 32 pmdCpdG]n DNA was subjected to methylation by 
SssI methylase using 320 /iM AdoMet. The DNA was repuri- 
1 fied twice using a NAP-5 column. The methylated DNA (1 

15 ng) was incubated with either 30 /zl DNA dMTase, nuclear 
extract or buffer L. To determine whether a 32 P labeled 
residue is excised from the DNA it was directly applied 
(3/il) onto a TLC plate. To determine whether the DNA 
was demethylated it was subjected to digestion with 

20 snake venom phosphodiesterase (0.2 mg in a 10/il reac- 
tion volume) (Boehringer Mannheim) which attacks the 
3' -OH group releasing 5 ' -mononucleotides . The result- 
ing mononucleotides were separated on TLC plates and 
autoradiographed . 

25 To test whether dCTP copurifies with DNA dMTase, 

which may be involved in activities other than bona 
fide demethylation, 20 /iM of dCTP with 1 /il of ct 32 P 
labeled dCTP (3000 Ci/mmole) was loaded onto the column 
with nuclear extract. The 32 P counts were measured in 

30 the flow through, washes and in the different frac- 
tions. About 1.1 million counts were loaded onto the 
DEAE-Sepharose column and were all recovered up to 
fraction 8. 

To determine whether DNA dMTase contains a DNA 
35 polymerase activity, DNA demethylase reactions were 
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performed in presence of 500 /zM of ddCTP (Pharmacia) or 
500 }M of mSdCTP (Boehringer Mannheim) at initial rate 
conditions . 

To determine whether DNA dMTase is sensitive to 
5 RNase or Proteinase K treatment, DNA dMTase was pre- 
treated for 1 h at 56 °C with 200 /ag/ml proteinase K 
(Sigma) . A demethylation reaction was carried out with 
this pretreated fraction in the usual manner using both 
demethylation assays described in the experimental pro- 

10 cedures. To test the effect of RNA digestion on the 
demethylation reaction, the fractions from different 
columns were treated with 100 /xg/ml RNase A (Sigma) . 
Demethylation of pBluescript SK(+) Plasmid 

About 4 /xg plasmid pBluescript SK (Stratagene) 

15 was subjected to methylation using SssI methylase. The 
methylated plasmid (4 ng) was incubated with 30 /il of 
DNA dMTase Fraction 4 of DEAE-Sephacel column under 
standard conditions,, extracted with phenol: chloroform 
and precipitated with ethanol . About 1 ng of the plas- 

20 mid were subjected to digestion with 10 units each of 
either of the restriction endonucleases EcoRII (GIBCO- 
BRL) , Dpnl, Hhal or Hpall (New England Biolabs) before 
and after methylation as well as after DNA dMTase 
treatment in a reaction volume of 10 /il for 2 hour at 

25 37 °C. Following restriction digestion the plasmids 
were extracted with phenol : chloroform, ethanol precipi- 
tated and resuspended in 10 /xl . The plasmids were 
electrophoresed on a 0.8% (w/w) Agarose gel, trans- 
ferred onto a Hybond Nylon membrane and hybridized with 

30 pBluescript SK(+) plasmid which was 32 P labeled by ran- 
dom-priming {Boehringer Mannheim) . 

Effect of Redox Reagents (NAD, NADH, NADP, NADPH and 
FeCl 3 ) on demethylase activity 

The reagents were prepared at 100 /zM concentra- 

35 tion and added at a final concentration of 10 /iM to a 

standard methyl removal assay under initial rate condi- 
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tions as described in the experimental procedures. The 
methyl removal activity in presence of each of the 
cof actors was compared to a control DNA dMTase reac- 
tion. 

5 Determination of kinetic parameters 

For determination of kinetic parameters, the 
demethylation reactions were performed using both 
assays (generation of dCMP and removal of methyl) as 
described in the experimental procedures except that 

10 varying DNA concentrations from 0.1 nM to 2.5 nM were 
used in a total volume of 50^1 including 30 fil of DNA 
dMTase. Since it has been established by previous 
experiments that the reaction proceeds for at least 3 
hours, the initial velocity of reaction was measured 

15 at one hour intervals. The velocity data was collected 
at each substrate DNA concentration range stated for 
both assays. The Km and Vmax values for DNA demethy- 
lase activity were determined from double reciprocal 
plots of velocity versus substrate concentration. 

20 Measurements of methanol production catalyzed by 
demethylase by gas chromatography 

Gas chromatography was performed with a Varian™ 

model 3400 GC equipped with a 30m Stabilwax™ column 

(0.053 cm i.d.: Restek Corporation). Nitrogen™ was 

25 used as carrier gas at a flow rate of 32 ml/min, the 
injector and detector chambers were at 200 and 300°C 
respectively. The column was maintained at 4 0°C for 5 
minutes after sample injection. 

The demethylase reaction was performed in eppen- 

30 dorf tubes kept within sealed scintillation vials with 
300 |al of water as aqueous phase (in radioactive trap- 
ping experiments this was replaced by 300 (il of metha- 
nol) . The demethylase reaction was initiated in buffer 
L (10 mM MgCl 2 , 10 mM Tris-HCl pH 8.0) with 500 ng of 

35 tritiated SK plasmid (6000 dpm/|al) and 100 )il of 
demethylase at 37°C. After overnight incubation at 37°C, 
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the aqueous phase surrounding the eppendorf tube was 
transferred to a fresh eppendorf tube, 2 ul of this 
mixture was injected in the gas chromatography using a 
gas tight syringe (Hamilton, Reno, Nevada) . 
5 Coupled in vitro transcription translation 

The mRNAs encoded by the pcDNA 3.1/His Xpress 
demethylase constructs described above were transcribed 
and translated by coupled transcription-translation 
using Promega™ TNT reticulocyte lysate kit (according 

10 to manufacturer's protocol), 2 /xg of each construct and 
40jiCi of [ 35 -S] methionine (1 , OOOCi/mmol , Amersham) in a 
50/zl reaction volume. To purify non labeled in vitro 
translated demethylase, coupled in vitro transcription 
and translation was performed as above but in the pres- 

15 ence of cold methionine. The translation products were 
bound to a Probond™ nickel column (Invitrogen) and 
demethylase was eluted according to the manufacturer's 
protocol with increasing concentrations of imidazole. 
Demethylase is eluted at 350-500mM imidazole. The imi- 

20 dazole eluted demethylase was dialyzed and concentrated 
by lyophilization. 

Gas chromatography coupled with Mass spectrometry (GC- 
MS) Analyses for identification of volatile product of 
25 demethylase catalyzed reaction as methanol 

The demethylation reactions (volume 50 1) were 
run in conical vials having a total internal volume of 
350 microlitres. The vials were closed with a teflon- 
lined screw cap and left at room temperature for 18 h. 

3 0 The vials were cooled in an ice bath, opened and 10 mg 
of NaCl and 50 microlitres of toluene were added. The 
vials were frequently shaken over a period of 1 h. The 
toluene phases were pipetted into clean vials in a man- 
ner to rigorously exclude water carry over. Anhydrous 

35 sodium sulfate (5 mg) was added to the toluene extracts 
to remove water, and the toluene phases were pipetted 



WO 99/24583 PCT/CA98/01059 



into autoinjector vials for GC/MS analysis. Aliquots 
of 3 microlitres were analyzed under the following 
instrumental conditions : Instrument : Hewlett-Packard 
5988A; Column: 30 m x 0.25 mm i.d. fused quartz capil- 
5 lary with 0.25 micron DB-1 liquid phase, programmed 
after an initial hold for 1 min at 70 deg at 5 deg/min 
to 80 deg, then ramped ballistically to 280 deg for 
bake-out for 5 min; Injector and interface tempera- 
tures: 250 deg; Helium flow rate 1.5 ml/min; Mass 
10 spectrometer: ion source 200 deg, 70 eV electron impact 
ionization, scanning from m/z 10 to 50 in full scan 
mode was begun 6 s after injection, and ceased at 1.5 
min to avoid acquisition of the intense toluene solvent 
peak. 

15 

Human A549 cells bear a demethylase activity that could 
be purified away from dCTP and DNA MeTase 

The use of an appropriate cellular source and a 

direct assay for demethylase activity are obviously 

20 critical. As we have previously shown that demethylase 
activity was induced in response to ectopic expression 
of the Ras oncogene (Szyf et al., 1995) we reasoned 
that cancer cells might bear high levels of demethy- 
lase activity. Based on preliminary studies demon- 

25 strating the presence of high levels of demethylase 
activity in the human lung carcinoma cell line A54 9, we 
have chosen this cell line for our further studies and 
purification steps. Previous studies have used indi- 
rect measures such as increased sensitivity to methyla- 

30 tion- sensitive restriction enzymes as indicators of 
demethylase activity (Weiss et al . , 1996; Jost et al . , 
1995) . To directly measure the conversion of 5-mdCMP 
in DNA to dCMP, we have utilized a completely methyl- 
ated 32 P labeled [mdC 32 pdG] n double stranded oligomer 

35 which we had previously described (Szyf et al . , 1995). 
Following incubation with the different fractions, the 
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DNA is purified and subjected to cleavage with microco- 
cal nuclease to 3' mononucleotides. The 3' labeled 
mdCMP and dCMP are separated by thin layer chromatogra- 
phy (TLC) and the conversion of mdCMP to dCMP is 
5 directly determined. This assay provides a stringent 
test for bona fide demethylation and discriminates it 
from previously described BmCpC replacement activities 
(Jost et al., 1995; Weiss et al . , 1996). The glyco- 
sylase-demethylase activity described by Jost et al . 

10 (Jost et al., 1995) will require the presence of a 
ligase activity and an energy source for replacement of 
mdC with C to be detected by our assay, whereas the 
demethylase activity described by Weiss et al . will not 
be detected since it replaces the intact mdC 32 pdG dinu- 

15 cleotide with a cold dCpdG without altering its state 
of methylation (Weiss et al., 1996). 

Nuclear extracts were prepared from A549 cells, 
applied onto a DEAE-Sephadex column, eluted with a lin- 
ear gradient from 0.2-5.0M NaCl and the fractions were 

20 assayed for demethylase (dMTase) activity as described 
in the experimental procedures. As shown in Fig. 1(A) 
a clear peak of dMTase activity is eluted at the high 
salt fraction 10. 

Conversion of methylated cytosine to cytosine: 

25 Nuclear extracts prepared from A549 cells (1.1 mg) were 
passed through an AMICON™ 100 spin column. The retain- 
ant (98.56 mg, 0.2 mg/ml) was loaded onto a DEAE-Sepha- 
rose column, the different chromatographic column frac- 
tions eluted by a linear NaCl gradient (0.2-5M) were 

3 0 desalted and (30 fil) incubated with 1 ng of [mdC 32 pdG] n 
double stranded oligomer for 1 hour at 37 °C, digested 
to 3' mononucleotides and analyzed on TLC as described 
in the experimental procedures. Control methylated 
(ME) and nonmethylated (NM) [dC 32 pdG] n substrates were 

35 digested to 3' mononucleotides and loaded on the TLC 



WO 99/24583 



- 24 - 



PCT/CA98/01059 



plate to indicate the expected position of dCMP and 
mdCMP. The active fraction is indicated by an arrow. 
This fraction was loaded on S-Sepharose followed by Q- 
Sepharose and DEAE-Sephacel fractionation. 
5 The first chromatography step purified the 

dMTase activity from the bulk of nuclear protein 
(Fig. IB) and is a very effective purification step. 

DNA dMTase activity as measured by the release 
of volatile methyl residues. The different column 

10 fractions were incubated with lng (4 x 10 6 dpm/^g) of 
[ 3 H] -CH 3 - [mdCpdG] n oligomer and the release of volatile 
methyl residues was determined (-) and presented as 
total dpn) . The results are an average of three inde- 
pendent determinations. Protein concentration was 

15 determined using the Bio-Rad Bradford kit (-) . The 
elution profile of 20 jzM of [ 32 P] -a-dCTP incubated with 
the protein was determined by scintillation counting of 
the different DEAE fractions (-) and presented as frac- 
tion of dCTP loaded on the column. 

20 To exclude the possibility that the DNA dMTase 

activity detected in our assay is carried by the DNA 
MeTase, we assayed the fractions for DNA MeTase activ- 
ity using a hemimethylated DNA substrate as previously 
described (Szyf et al . , 1991). As observed in Figure 

25 IB DNA MeTase activity is detected in the second and 
third fractions, thus our fractionation separated DNA 
dMTase away from the DNA MeTase suggesting that they 
are independent proteins. 

There is a remote possibility that the demeth- 

30 ylation observed is not a bona fide demethylation but 
a consequence of a glycosylase removal of mC, followed 
by removal of the remaining deoxyribose-phosphate by AP 
(apyrimidine) nuclease, repair of the gap catalyzed by 
DNA polymerase using trace dCTP contained in the frac- 

35 tion and ligation of the break with ligase in the pres- 
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ence of residual ATP. For this hypothesis to be con- 
sistent with our data, four independent enzymes and two 
cof actors have to cof ractionate with DNA dMTase . To 
exclude the possibility that a trace amount of dCTP is 
5 bound to DNA dMTase active fraction, we have added 2 0 
/xM of 32 P labeled dCTP (10x10 s cpm) to the nuclear 
extract and determined its elution profile on the DEAE 
column. Less than background cpm (10 cpm) were 
detected in the DNA dMTase active fraction suggesting 

10 that our first column purifies dCTP away from the DNA 
dMTase at least IxlO 6 fold (Fig. IB) . If any dCTP is 
present in the nuclear extract, the remaining concen- 
tration after fractionation on DEAE is well below the 
Kms of the known DNA polymerases. The possibility that 

15 dCTP is so tightly bound to the enzyme that it could 
not be replaced by the exogenous 32 P labeled dCTP is 
very remote since an enzyme using dCTP as substrate 
must readily exchange dCTP. 

The active fraction 10 was further fractionated 

20 sequentially on the following columns: S-Sepharose and 
Q-Sepharose. The DNA dMTase eluted at the high salt 
fraction from both columns as determined by the 
[mdC 32 pdG]n demethylat ion assay (Fig. 1A) . The ion 
exchange chromatography was followed by chromatography 

25 on DEAE-Sephacel . 

The fact that we have maintained our activity 
even after 4 fractionation steps (Table 1) and that 
only a single polypeptide is apparent after the last 
purification step argues strongly against the possibil- 

30 ity that the activity detected in our study is a repair 
or replacement activity. Any replacement mechanism 
must involve a number of proteins and additional cofac- 
tors and substrates. In summary, the chromatography of 
the demethylase activity in A459 cells provides strong 
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support to the hypothesis that mammalian cells bear a 

bona fide demethylase activity. 

DNA dMTase releases a volatile derivative 

A Jbona fide demethylation has to result in 
5 release of the methyl group as a volatile derivative 
such as C0 2/ methanol, methane or formaldehyde. We 
have therefore incubated a { [ 3 H] -CH 3 -dCpdG}n double 
stranded oligonucleotide with the different column 
fractions and the rate of release of the tritiated 
10 methyl from the aqueous phase was determined by scin- 
tillation counting of the remaining radioactivity in 
the reaction mix. As demonstrated in Fig. lb (dia- 
mond) , the dMTase active fractions release labeled 
methyl groups from the methylated substrate. 

15 

DNA dMTase is a protein which is inhibited by RNA, does 
not involve an exchange activity and does not require 
additional cofactors 

DNA dMTase activity measured either as transfor- 

20 mation of mdC to C (Fig. 2a) or as release of volatile 
methyl residues (Fig. 2c) is abolished after proteinase 
K treatment and is not inhibited but rather enhanced 
following RNase treatment. 500 /zM of ddCTP which 
inhibits DNA polymerase does not inhibit demeth- 

25 ylation of the [mdC32pdG]n substrate, nor is it inhib- 
ited by high concentrations of methyl-dCTP (500 /xM) 
(Fig. 2a) , which is consistent with the hypothesis that 
demethylation does not involve an excision and replace- 
ment mechanism. If a replacement mechanism is involved 

30 in demethylation, the presence of mdCTP should result 
in incorporation of methylated cytosines and essential 
inhibition of demethylation. Thus, the DNA dMTase 
identified here is a protein and not an RNA and is une- 
quivocally different from the previously published RNA 

35 based or glycosylase based demethylase activities. 
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The DNA dMTase reaction proceeds without any 
requirement for additional substrates such as dCTP, 
redox factors such as NADH and NADPH or energy sources 
such as ATP (data not shown) . As observed in Fig. 2b 
5 and 2d, the DNA dMTase reaction maintains its initial 
velocity up to 90 minutes and continues up to 120 min- 
utes. This time course is inconsistent with dependence 
on enzyme -bound additional nonreplenishable substrates 
such as dCTP or ATP or a nonreplenishable redox factor 
10 such as NADH or NADPH. Exhausting the nonreplenish- 
able substrate or redox factor would have resulted in 
rapid deceleration of the initial velocity. 

A product of the demethylation reaction is deoxyCyto- 
15 sine in DNA 

What is the product of the demethylation reac- 
tion? The results presented above (Fig. la, 2a and b) 
based on a one dimension TLC separation show that DNA 
dMTase generates dC from mdC in DNA. To further sub- 

20 stantiate this conclusion, we subjected DNA dMTase 
treated DNA to remethylat ion with the CpG MeTase M.Sss 
I which can transfer a methyl group exclusively to dC. 
The results presented in Fig. 3a show that the demeth- 
ylated product of DNA dMTase is dC since it is com- 

25 pletely remethylated with M.Sss I. The identity of the 
demethylated product as dC was further established by a 
two-dimension TLC analysis demonstrating that the prod- 
uct of dMTase comigrates with a cold dCMP standard in 
both dimensions (Fig. 3b).. 

3 0 DNA dMTase does not release a nucleotide, a 

phosphorylated base or phosphate from methylated DNA 
when incubated with a [32pmdCpdG]n substrate which 
included a labeled 32P 5' to mdC or our standard meth- 
ylated substrate (Fig.l) where 32P is 3' to the m5dC 

35 (Fig. 3c) . Nuclear extracts which obviously contain a 
number of glycosylases and nucleases release phospho- 
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rylated derivatives in the same assay (Fig. 3c) . 
dMTase transforms the methyl cytosine in the 
[32pmdCpdG]n substrate to cytosine as demonstrated 
when the reacted DNA is digested to 5' mononucleotides 
5 (Fig. 3c +V PDS) and analyzed by TLC. Since this 

reaction does not involve release of a 32P derivative 
(Fig. 3c -V PDS), it demonstrates that dMTase trans- 
forms methylated cytosines to cytosines on DNA without 
disrupting the integrity of the DNA substrate by glyco- 
10 sylase or nuclease activity . 

The second product of the dMTase reaction is methanol 

What is the identity of the leaving group? The 
results presented in Figlb suggest that the labeled 

15 methyl leaves the DNA as a volatile compound. The 
demethylase reaction involves release of the methyl 
group per se whereas the cytosine base ring remains in 
the aqueous phase. Fig. 4a demonstrates this point by 
using a methylated plasmid labeled with a 3 H-hydrogen 

20 at the sixth position of cytosine and [14C] -methyl at 
the fifth position of cytosine as a substrate. 

The three most obvious candidates the methyl 
group is leaving as are formaldehyde, carbon dioxide, 
and methanol. Methadone trapping for labeled formalde- 

25 hyde detection and sodium hydroxide trapping for 
labeled carbon dioxide detection were both negative in 
identifying the form in which the methyl group is leav- 
ing in the dMTase reaction (data not shown) . The other 
possible chemical form that the methyl group may leave 

30 the DNA as, is methanol. Since methanol is a volatile 
compound, a simple method to measure generation of 
methanol is a scintillation-volatilization assay (see 
Fig. 4b for description) . Volatilization assays have 
been previously used to measure release of methanol in 

35 demethylation reactions. The demethylation reaction 
mix containing the labeled { [ 3 H] -CH 3 -dCpdG}n substrate 
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with either dMTase or no enzyme, as a control, is added 
to an uncapped 0 . 5 ml tube which is placed in a sealed 
scintillation vial containing scintillation fluid. 
Released methanol is volatile, diffuses out of the open 
5 reaction tube and is mixed with the excess of the scin- 
tillation fluid in the vial registering as counts in 
the scintillation counter. As a control indicating 
that methanol is volatilized under the conditions of 
our assay, we incubated approximately equal counts of 

10 radioactively labeled methanol under the same condi- 
tions and measured the counts in a scintillation coun- 
ter at different time points. As observed in Fig. 4c 
the majority of methanol in the reaction tube volatil- 
izes from the reaction tube into the scintillation 

15 fluid following an overnight incubation at 3 7°C. The 
experiment shown in Fig. 4b demonstrates that volatil- 
ized label is released from methylated DNA only in the 
presence of dMTase. 

The identity of the volatile group has been 

2 0 determined to be methanol by a gas chromatography (GC) 
analysis. The demethylation and control reactions 
(indicated in Fig. 4e) were performed in an uncapped 
tube placed in a sealed scintillation vial containing a 
larger volume (300/zl) of water. The volatile residue 

25 diffuses into the surrounding water and mixes with it. 
A 2 jil sample of the surrounding water was injected 
into a GC column as described in the methods. As 
shown in Fig. 4e, the volatile compound released by 
dMTase in a dose response manner coelutes with met ha - 

30 nol. Release of methanol is observed only in the pres- 
ence of both dMTase and methylated DNA. No methanol is 
released when dMTase is reacted with nonmethylated DNA, 
demonstrating that methanol is a product of demethyla- 
tion of DNA. 
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The leaving group was also identified as metha- 
nol using gas chromatography coupled with Mass spec- 
trometry (GC-MS) . As illustrated in Fig. 4f., incuba- 
tion of methylated DNA with dMTase (dMTase+ME-DNA) 
5 results in release of a peak with the retention time 
and mass spectrum (peaks are identified at 32 and 2 9 
atomic mass which are the atomic masses of methanol and 
ionized methanol respectively) which is consistent with 
its identification as methanol. Incubation of dMTase 

10 with nonmethylated DNA does not release methanol indi- 
cating that methanol is a product of the demethylation 
reaction. No methanol is released when the samples are 
incubated with dMTase treated with protease K indicat- 
ing that the release of methanol from methylated DNA is 

15 catalyzed by an enzymatic activity. 

Demethylation involves transfer of a hydrogen from 
water to regenerate cytosine 

If demethylation involves removal of the methyl 

2 0 moiety from mdC, a hydrogen has to be transferred to 

the carbon at the 5 ' position to regenerate cytosine . 
Since no redox factors are involved, what is the source 
of the hydrogen? To test the hypothesis that the 
source of the hydrogen is water, we incubated either 

25 non labeled [mdCpdG] n or [dCpdG] n double stranded DNA 
with DNA dMTase for different time periods in the 
presence of tritiated water, following which the DNAs 
were digested to 3' dNMPs, separated on TLC with non- 
radioactive standards for each of the 5 possible dNMPs 

30 and exposed to a tritium sensitive phosphorimaging 
plate. As seen in Fig.4d, dMTase catalyzes the trans- 
fer of a tritiated hydrogen from water to dCMP in meth- 
ylated DNA in a time dependent manner only when meth- 
ylated DNA is used as a substrate. Based on the 

3 5 experiments described in Fig. 3 and 4 we propose that 

dMTase catalyzes the exchange of the methyl group at 
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the 5' position of cytosine in DNA with hydrogen from 
water and the methyl group reacts with the remaining 
hydroxyl group to form methanol (Fig. 5) . 

5 Substrate and sequence specificity of DNA dMTase 

Methylation of CpG dinucleotides is the most 
characterized modification occurring in genomic 
DNA8,48. The results presented in Fig. 6 demonstrate 
that DNA dMTase is a general DNA dMTase activity that 

10 demethylates fully or hemimethylated dCpdG in DNA 
flanked by a variety of sequences which are distributed 
at different frequencies, but does not demethylate 
methylated adenines or methylated cytosines that do not 
reside in the dinucleotide CG. First, as shown in 

15 Fig. 6a, a plasmid DNA methylated in vitro at all dCpdG 
sites with M.Sss I and all d*CdCdGdG sites with M. Msp 
I (which methylates the external C in the sequence 
*CCGG, thus enabling the determination of demethylation 
at the CC dinucleotide) and in vivo with the E. coli 

20 DCM MeTase at dCmdCdA/dTdGdG sites and with the DAM 
MeTase at dGmdAdTdC sites (adenine methylated) was 
treated with dMTase and the state of methylation of the 
plasmid was determined using the indicated methylation 
sensitive restriction enzymes. dMTase demethylates C*G 

25 methylated sites as indicated by the sensitivity of the 
dMTase" treated plasmid to Hpa II and Hha I but does not 
demethylate C*C,C*A or C*T methylated sites as indi- 
cated by the resistance to Msp I and Eco RII restric- 
tion enzymes, or adenine methylation as indicated by 

30 its sensitivity to Dpn I. Second, bisulfite mapping 
analysis of methylation of 5 methylated C*G sites 
residing in a M.Sss I in vitro methylated pMetCAT plas- 
mid following dMTase treatment shows that all C*G sites 
are demethylated irrespective of their flanking 

35 sequences thus excluding the possibility that demeth- 
ylation is limited to CCGG or CGCG sequences (Fig. 6b) . 
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Third, dMTase does not demethylate two fully methylated 
cytosine bearing oligomers [dmC32pdA] n, [mdC32pdT] n 
demonstrating that mdCpdA and mdCpdT are not demethyl- 
ated by DNA dMTase (Fig. 6d) . Fourth, dMTase demethyl- 
5 ates a hemimethylated synthetic substrate 
[dCpdGln* [mdC32pdG]n (Fig. 6d) . Demethylation of SK is 
complete under these conditions (Fig. 6a) whereas 
demethylation of a methylated [mdCpdG] n substrate is 
not complete under the same conditions (Fig. 6d) . This 

10 can reflect differences in the sequence composition of 
the substrate and the frequency of methylated cyto- 
sines. The [mdCpdG] n contains on average 16 fold more 
methylated cytosines per molecule than plasmid DNA. 
Alternatively, these differences might reflect discrep- 

15 ancies in the assays used, restriction enzyme digestion 
versus a nearest neighbor analysis. To address this 
discrepancy we have labeled a fully methylated SK plas- 
mid with [a 32 P]dCTP, 5 -methyl-dCTP and the other dNTPs, 
subjected it to dMTase treatment and digested it to 

20 mononucleotides at different time points following the 
initiation of the reaction and subjected the samples to 
a TLC analysis. As shown in Fig. 6c, the SK plasmid is 
fully demethylated at 3 hours which is consistent with 
the results obtained with methylation sensitive 

25 restriction enzymes (Fig. 6a) . 

The Km of DNA dMTase for hemimethylated and 
fully methylated DNA was determined by measuring the 
initial velocity of the reaction at different concen- 
trations of substrate (Table 2) . The calculated Km for 

30 hemimethylated DNA is 6 nM which is two fold higher 
than the Km for DNA methylated on both strands, 2.5-3 
nM (Table 2) . It is unclear yet whether this small 
difference in affinity to the substrate has any sig- 
nificance in a cellular context. Thus similar to the 

35 DNA MeTase DNA dMTase shows dinucleotide sequence 
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selectivity but in difference from DNA MeTase which 
shows preference to hemimethylated substrates dMTase 
prefers fully methylated DNA which is consistent with a 
role for DNA dMTase in altering established methylation 
5 patterns . 

Table 1 
Purification of DNA dMTase 



Purification step 


Total 
protein 

H) 


Total dpm 


pMole/pg 


pMole/pg/h 


Fold 
Purification 


Nuclear extract 


6000 


1107.2 


5.5 x 10" 5 


1.833 x 10* 




DEAE-Sephadex 


3.75 


5844 


0.4674 


0.156 


8445.5 


SP-Sepharose 


0.77 


5106 


1.989 


0.663 


35939.84 


Q-Sepharose 


0.46 


5335 


3.4 


1.13 


62860.65 


DEAE-Sephacel 


0.018 


1834 


30.57 


10.19 


552243.2 



10 Table 2 



Kinetic 


parameters for DNA 


dMTase 


Method 


K„ (DNA) 


V max (pMole/h) 


Methylated oligo CpG 


2.5 nM 


340 


Hemi-methylated CpG 


6.0 nM 


402 


Methylated SK-DNA 


3.3 nM 


40.42 



Cloning and construction of demethylase expression 
vectors 

15 PCR amplification of the MBD domain of the putative 
demethylase candidate cDNA 

One fig of total RNA prepared from the human 
small lung carcinoma cell line A549 was reverse tran- 
scribed using Superscript reverse transcriptase and 
20 random primers (Boehringer) in a 25 fil reaction volume 
according to conditions recommended by the manufacturer 
(GIBCO-BRL) . Five /xl of reverse transcribed cDNA were 
subjected to an amplification reaction with Taq poly- 
merase (Promega, 1 unit) using the following set of 
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primers : sense 5 ' CTGGCAAGAGCGATGTC 3 ' SEQ ID NO : 9 , 
antisense 5 ' AGTCTGGTTTACCCTTATTTTG 3* SEQ ID NO: 10. 

Amplification conditions were: step 1. 95°C 1 
min.; step 2: 94°C 0.5 min; step 3: 45°C 0.5 min.; step 
5 4: 72°C 1.5 min; steps 2-4 were repeated 30 times. 
MgCl 2 was adjusted to 1 mM according to conditions rec- 
ommended by the manufacturer. The PCR products were 
cloned in pCR2 . 1 vector (InVitrogen) and the sequence 
of the cDNAs was verified by dideoxy- chain termination 

10 method using a T7 DNA sequencing kit (Pharmacia) . The 
amplified fragment was excised from the plasmid with 
EcoRI, labeled with a Boehringer random prime labeling 
kit according to manufacturer's protocol and alpha 32 P- 
dCTP. The labeled probe was used to screen a HeLa cell 

15 cDNA library in XTriplEx phage (Clontech) according to 
standard procedures. Positive clones were identified 
and further purified by serial dilutions for 4 rounds. 
The insert in the pTriplEx plasmid was excised from the 
phage according to manufacturer's protocols and the 

20 identity of the insert was verified by sequencing. The 
insert was excised by NotI restriction and subcloned 
into either the inducible expression vector: Retro tet 
on (Clontech) in the sense and antisense orientation or 
the pcDNA3.l/His Xpress vector in all three frames and 

25 in the antisense orientation. 

Transf ection and expression of demethylase in verte- 
brate cells 

Ten /ig of either Retro tet on demethylase or 
30 pcDNA 3.1/His Xpress demethylase are mixed with 8 /il of 
transf ection lypophilic reagent Pfx-2 (Invitrogen) and 
placed upon 100,000 mouse (3T3 Balb/c, human (A549) or 
monkey cells (CV-1) according to manufacturer's proto- 
col in OPT I MEM medium for 4 hours. Cells are harvested 
35 after 48 hours and demethylation and demethylase activ- 
ity is determined by measuring total genomic DNA meth- 
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ylation using standard techniques or a cotransf ected in 
vitro methylated plasmid using a Hpall /Mspl restric- 
tion enzyme analysis. Cellular transformation is meas- 
ured by a soft agar assay. 

5 

Demethylation of pBluescript SK(+) Plasmid 

About 4 /xg plasmid pBluescript SK (Stratagene) 
was subjected to methylation using SssI methylase. The 
methylated plasmid (4 ng) was incubated for different 

10 time points as indicated with 3 0 jul of DNA dMTase 
Fraction 4 of DEAE-Sephacel™ column under standard con- 
ditions, extracted with phenol: chloroform and precipi- 
tated with ethanol. About 1 ng of the plasmid were 
subjected to digestion with 10 units each of either of 

15 the restriction endonuclease EcoRII (GIBCO-BRL) , Dpnl, 
or Hpall (New England Biolabs) before and after meth- 
ylation as well as after DNA dMTase treatment in a 
reaction volume of 10 fil for 2 hour at 37°C. Following 
restriction digestion the plasmids were extracted with 

20 phenol : chloroform, ethanol precipitated and resuspended 
in 10 fil. The plasmids were electrophoresed on a 0.8% 
(w/w) Agarose gel, transferred onto a Hybond™ Nylon 
membrane and hybridized with pBluescript SK(+) plasmid 
which was 32 P labeled by random-priming (Boehringer 

25 Mannheim) . 

dMTase activity coelutes with a -45 KDa polypeptide 
when sized under denaturing conditions but migrates as 
a higher molecular weight complex under non denaturing 

30 conditions. dMTase was purified up to 500, 000 fold by 
four chromatographic steps (Table 1) . We first deter- 
mined the identity of the polypeptide associated with 
dMTase activity by SDS-PAGE analysis of the active 
fractions . As observed in Fig . 7a, a cluster of 4 

35 polypeptide bands from -44 KDa to 35 KDa coelute with 
dMTase activity in the last two chromatographic steps 
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(the lower fragment might be a degradation product as 
evidenced by its abundance in the later chromatographic 
steps) . However when the active DEAE-Sephacel fraction 
is size fractionated on a 4% non denaturing acrylamide 
5 column, the dMTase activity elutes at the high molecu- 
lar weight of -170 KDa (Fig. 7c, fraction 63) . SDS- 
PAGE analysis of this fraction (63) reveals only two 
bands (Fig. 7b) observed in the active chromatographic 
fractions (Fig. 7a) To further determine whether 

10 dMTase is found in a multimeric complex, fraction 63 
was size fractionated on a glycerol gradient (Fig. 7d) 
and DNA dMTase activity eluted at the -170 kDa range. 
As only two main small polypeptides were identified in 
fraction 63 (approximately 35-43 KDa) , dMTase is proba- 

15 bly found in either a homomeric complex if only one of 
the two peptides is dMTase or a heteromeric complex if 
both polypeptides are associated with dMTase activity. 

a. Identification of a lead DNA dMTase candidate by 
20 homology search of dbEST 

As the purification of dMTase suggests that the 

dMTase is of very low abundance, only -19 ng of dMTase 

could be isolated from 6 mg of nuclear extract 

(Table 1) , we opted for cloning the dMTase based on its 

25 following functional properties. First, since dMTase 
specifically demethylates methylated CG dinucleotides , 
we assumed that it should bear the ability to recognize 
methylated CG dinucleotides. Second, the demethylase 
transforms methylated cytosine in DNA to cytosine. 

30 Third, the demethylase releases the methyl group as a 
volatile compound. 

Previous reports have shown that proteins inter- 
acting with methylated DNA share a common domain 
(MDBD) . A TBLASTN search of the dbEST database identi- 

35 fied a novel . expression tag cDNA (from a T-cell lym- 
phoma Homo sapiens cDNA 5' end) (gb/AA361957/AA361957 
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EST71295) and the mouse homologue ( (gb/W97165/W97165 
mf90g05.rl) from Soares mouse embryo NbME13.5) with 
unknown function that bears homology to the MDBD 
(Fig. 8a) . A search of the GenBank database verified 
5 that it is a novel cDNA that has not been included in 
GenBank. Alignment of the novel EST and MeCP2 and 
MeCPl associated protein has revealed no homology 
beyond the previously characterized MDBD which is con- 
sistent with a different function for this methylated 

10 DNA binding protein. A 201bp fragment bearing the 
sequence identified in the search was reverse tran- 
scribed and amplified from human lung cancer cell line 
A54 9 RNA and was used to screen a cDNA library from 
Hela cells. The largest insert cloned was of 1.36 kb 

15 size and its sequence identity with the EST sequence 
was determined. The cDNA is novel and has no homologue 
in GenBank and no function has ever been assigned to 
it. A virtual translation of the protein identified an 
open reading frame (ORF) of 262 amino acids (Fig. 8b). 

20 The ORF may extend further 5' as no in frame stop codon 
was found upstream of this ATG. However, RACE analy- 
ses and further searches of the dbEST have failed to 
identify 5' sequences upstream to the one identified in 
our screening . 

25 A BLAST search of the candidate protein using 

the Predict protein server against a database of pro- 
tein domain families has identified only the MDBD 
domain and found no homologue to the sequence in the 
data base search. No other functional motifs were 

30 identified by the Prosite analysis. This is consistent 
with a novel biochemical function for this protein. A 
coiled coil prediction of the sequence identified a 
coiled coil domain which is known to play a role in 
protein protein interactions. 
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The identified cDNA encodes an mRNA that is 
widely expressed in human cells as revealed by a North- 
ern blot analysis of human poly A+ mRNA (Fig. 8c) as 
one major transcript of - 1 . 6 kb which is close to the 
5 size of the cloned cDNA, verifying that the cloned cDNA 
does not represent a highly repetitive RNA but rather a 
mRNA encoded by a single or low copy number gene. 

In vitro translated candidate cDNA bears dMTase activ- 
10 ity 

A conclusive proof for the existence of a single 
protein that bona fide demethylates DNA is to demon- 
strate that an in vitro translated candidate cDNA can 
volatilize methyl groups from methylated DNA and trans- 

15 form a methyl cytosine to cytosine in an isolated sys- 
tem. The candidate dMTase cDNA was subcloned it into a 
pcDNA3.l/His Xpress (INVITROGEN) expression vector in 
the putative translation frame (pcDNA3.1His A) and in a 
single base frame shift (pcDNA3.1His B) , and was in 

20 vitro transcribed and translated in the presence of 
35 S-methionine and the resulting translation products 
were resolved by SDS-PAGE. Autoradiography revealed a 
~40KDa protein (Fig. 10a) . The apparent size of the in 
vitro translated protein is shorter by -3-5 KDa from 

25 the apparent size of the purified protein. The cloned 
cDNA might be missing some upstream amino acids as dis- 
cussed above or might be differently modified in human 
cells . 

Two tests established whether the in vitro 
30 translated candidate cDNA is a bona fide dMTase. We 
first tested whether in vitro translated protein 
(purified on a Ni2+ charged agarose resin) can volatil- 
ize and release methyl residues in [ 3 H] -CH 3 -DNA using a 
radioactive trapping volatilization assay. To verify 
35 that the volatilized counts are true 3 H counts, a spec- 
trum analysis was performed. As demonstrated in Fig. 
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10b no volatilization of tritiated methyl residues is 
observed in the misframe dMTase (misframe) whereas in 
vitro translated putative dMTase cDNA catalyzes the 
volatilization of 3 H-CH 3 residues which are trapped in 
5 the scintillation cocktail. 

Second, in vitro translated dMTase cDNA trans- 
forms CH 3 -cytosine residing in [ 32 P] -ot-dGTP labeled 
plasmid DNA or in [methyl -dC32pdG] n double stranded 
oligomer DNA to cytosine, whereas a frame shift in 

10 vitro translated dMTase does not demethylate DNA (Fig. 
lOd) . This demonstrates that the dMTase activity is 
dependent on the dMTase translation product and not a 
contaminating activity found in the in vitro transla- 
tion kit that copurifies with the putative dMTase. The 

15 reaction carried out by the in vitro translated dMTase 
displays: dependence on the dose of in vitro translated 
product (Fig. 10c), time dependence (Fig. lOd) and 
dependence on translated protein (Fig. 10b & d mis- 
frame, Fig. 10c protease K treatment) . Taken together, 

20 these results strongly suggest that the cDNA cloned 
here codes for a jbona fide enzymatic DNA demethylase 
activity. 

Transiently transfected dMTase cDNA demethylates DNA 

25 dMTase cDNA and the pcDNA3.1HisC vector control 

were transiently transfected into human embryonal kid- 
ney cells to test whether the cDNA can direct expres- 
sion of dMTase activity in human cells. The His- tagged 
proteins were bound to Ni2+ agarose resin and eluted 

30 from the resin with increasing concentrations of imida- 
zole. The expression of the transfected dMTase was 
verified by a Western blot analysis (Fig. lib) . The 
imidazole fractions were assayed for their ability to 
volatilize and release methyl residues in [ 3 H] -CH 3 -DNA 

35 using a radioactive trapping volatilization assay 1. 
As observed in Fig. 11a, imidazole fractions from 
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dMTase transfected cells volatilize [ 3 H] -CH 3 whereas no 
tritiated counts are detected in DNA treated with imi- 
dazole fractions from cells transfected with a misframe 
mutation of dMTase or non transfected cells. The tran- 
5 siently expressed dMTase transforms methylated cytosine 
in DNA to cytosine residing in two different substrates 
(Figs. 11c & lid), in a protein dependent manner (Figs. 
11c & lie) , and the reaction displays substrate depend- 
ence and saturability (Fig. llf ) . Transiently 

10 expressed dMTase was loaded on a non denaturing glyc- 
erol gradient to determine its native MW. Similar to 
dMTase purified from human cells, cloned and purified 
dMTase activity fractionated at the 160-190 KDa range 
(data not shown) . This is consistent with self asso- 

15 ciation of cloned dMTase possibly mediated by the 
coiled-coil domain . 

Cloned DNA dMTase catalyzes a hydrolysis of 5 -methyl - 
cytosine to release methanol 

2 0 We determined the mechanism by which methyl 

residues are released by the cloned dMTase (from Fig. 
11) and compared it to the purified bona fide dMTase 
activity. Increasing amounts of non labeled [methyl - 
dCpdG] DNA were incubated with either the bona fide 

25 dMTase activity purified from A549 cells or the cloned 
dMTase in the presence of [ 3 H] water for 3 hours fol- 
lowed by digestion to mononucleotides, a thin layer 
chromatography and autoradiography. As Fig. 12a shows, 
both reactions replace the methyl group in 5-methylcy- 

30 tosine with a proton donated from water as indicated by 
the presence of [ 3 H] label in cytosine. 

The identity of the leaving methyl group in the 
demethylation reaction catalyzed by the purified bona 
fide dMTase activity was shown to be methanol. In 

35 order to identify the form that the methyl residue 
leaves as in the demethylation reaction catalyzed by 
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the cloned dMTase an identical gas chromatography/mass 
spectrometry analysis of the reaction products was per- 
formed as inl. Only the properly translated form of 
dMTase (both in vitro translated and transiently trans- 
5 fected and purified) is able to produce ions character- 
istic of methanol in a mass spectrometric analysis 
(mass of 32 and 29, Fig. 12b) . These results suggest 
that the demethylation reaction catalyzed by the cloned 
dMTase is hydrolysis of the 5 -methyl -cytosine to cyto- 
10 sine and methanol as described for the purified 
dMTase 1 . 

DNA dMTase activity is undetectable in nontrans formed 
cells 

The assays for dMTase activity described here 
15 and the cloning of DNA dMTase cDNA enables a study of 
its expression at different cellular states. Global 
hypomethylation of DNA is a common observation in can- 
cer cells. This has been a perplexing observation, 
since DNA MeTase activity is elevated in cancer cells. 

2 0 Hyperactivation of DNA MeTase has been proposed to play 

a role in cancer development. This paradox raises 
questions on the proposed role of the elevated levels 
of DNA MeTase in cancer cells. One simple explanation 
that has been previously suggested to resolve this 
25 paradox is that cancer cells express induced levels of 
DNA dMTase. We compared the DNA dMTase activity in 
equal concentrations of DEAE-Sephadex fractionated 
nuclear extracts (fractions 9-10) prepared from a num- 
ber of carcinoma cell lines H446, Colo 205, Hela, and 

3 0 A54 9 with a similar preparation from human skin fibro- 

blast cells at initial rate conditions using 
[mdC32pdG]n double stranded oligomer as a substrate. 
As observed in Fig. 13a, whereas DNA dMTase activity is 

r 

readily observed in all carcinoma cell lines, it is 

35 undetectable in nontransf ormed human cells. The 

absence of dMTase activity in human primary cells 
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reflects the situation in vivo since ciMTase activity is 
undetectable in preparations from different murine tis- 
sues whereas dMTase activity is present in a murine 
carcinoma cell line P19 that was transfected with the 
5 H-Ras protooncogene, or human tumors carried as xeno- 
grafts in the same strain of mouse (Fig. la: COLO 205, 
A549. Hela) . These conclusions were verified using the 
radioactive- trapping volatilization assay shown in Fig. 
13c. 

10 Since dMTase mRNA has been detected using a sen- 

sitive poly A+ Northern blot in all normal human tis- 
sues, we tested the hypothesis that the absence of 
detected dMTase activity in normal tissues reflects a 
quantitative difference in DNA dMTase mRNA between nor- 

15 mal tissues and cancer lines. A Northern blot analysis 
and quantification of dMTase mRNA by a slot blot analy- 
sis shown in Fig. 13d using total RNA supports this 
hypothesis. Whereas minute levels of dMTase mRNA are 
detected in normal tissues, high levels of dMTase are 

20 expressed in a murine carcinoma cell line Yl that bears 
a 30 fold amplification of Ha-ras. 

A second DNA demethylase dMTase2 identified in human 
and mouse 

cDNA sequences, predicted amino acid sequences, and 
25 GenBank accession numbers of both dMTasel and dMTase2 
from human and mouse are shown. We claim that the high 
level of identity of the two proteins (Figs 9c and e) 
suggests that the two proteins can perform the same 
function, DNA demethylation. The N- terminals of 
30 dMTasel and dMTase2 contain a Methylated DNA Binding 
Domain (MBD) and near their C-terminals is a coiled- 
coil domain, however the middle portions of the protein 
sequences have no homology to any know structural or 
catalytic motif. Importantly, their middle regions are 
35 still extensively homologous suggesting that the cata- 
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lytic site of the demethylase activity lies in this 
area on both proteins. 

Induced expression of DNA demethylase in the Antisense 
orientation inhibits tumorigenesis ex vivo 

5 To test the hypothesis that inhibition of DNA 

dMTase can inhibit tumorigenesis tetracycline inducible 

vectors carrying the human dMTasel cDNA in either the 

sense or antisense orientation were constructed and 

transiently transfected into HEK 293 cells, treated for 

10 48 hours either in the presence or absence of doxycy- 
cline (a tetracycline analogue) , selected for the last 
24 hours with puromycin, and then plated on soft agar 
and allowed to grow for seven days. After seven days 
colonies were scored and the data presented clearly 

15 show that doxycycline induced expression of the dMTasel 
cDNA in the antisense orientation reduced colony forma- 
tion (Fig. 15) . 

Imidazole is a small molecule inhibitor of DNA 
demethylase activity 

20 A template small molecule, imidazole, was tested 

for the ability to inhibit DNA dMTase activity. In a 
volatilization of radioactive methyl residues assay, 
concentrations from 1/zM to lOmM of imidazole were incu- 
bated in a typical volatilization of radioactive methyl 

25 residues as described above. The graph clearly demon- 
strates a dose dependent inhibition of DNA dMTase 
activity by imidazole, and validates a rationale for 
testing imidazole based molecules as inhibitors of DNA 
dMTase activity (Fig. 16). 

30 Identification of DNA demethylase cDNAs and protein 
sequences 

Fig. 9a illustrates cDNA sequence of human dMTasel (SEQ 
ID NO:l) and its predicted amino acid sequence (SEQ ID 
N0:2), including its Genbank location. Fig- 9b illus- 
35 trates cDNA sequence of human dMTase2 (SEQ ID NO: 3) and 
its predicted amino acid sequence(SEQ ID N0:4) , includ- 
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ing its GenBank location. Fig. 9c illustrates protein 
sequence alignment of human dMTasel and human dMTase2 . 
Fig. 9d illustrates cDNA sequence of mouse dMTasel (SEQ 
ID NO: 5) and its predicted amino acid sequence (SEQ ID 
5 N0:6), including its GenBank location. Fig. 9e illus- 
trates cDNA sequence of mouse dMTase2 (SEQ ID NO: 7) and 
its predicted amino acid sequence (SEQ ID NO: 8), 
including its GenBank location. Fig. 9f illustrates 
protein sequence alignment of mouse dMTasel and mouse 
10 dMTase2. 

While the invention has been described in con- 
nection with specific embodiments thereof , it will be 
understood that it is capable of further modifications 
and this application is intended to cover any varia- 

15 tions, uses, or adaptations of the invention following, 
in general, the principles of the invention and 
including such departures from the present disclosure 
as come within known or customary practice within the 
art to which the invention pertains and as may be 

20 applied to the essential features hereinbefore set 
forth, and as follows in the scope of the appended 
claims . 
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WHAT IS CLAIMED IS ; 

1. a DNA demethylase enzyme and/or homologue 
thereof having about 4 0 KDa, and wherein said DNA 
demethylase enzyme is overexpressed in cancer cells. 

2. A cDNA encoding a human demethylase which com- 
prises a sequence set forth in SEQ ID NOS:l and 3. 

3. a cDNA homologous to the cDNA of claim 2, 
wherein said cDNA encoding mouse demethylase set forth 
in SEQ ID NOS : 5 and 7. 

4 . The use of the expression of demethylase cDNA of 

claims 2 or 3 to alter DNA methylation patterns of DNA 
in vitro in cells or in vivo in humans, animals and in 
plants . 

5. The use of claim 4, wherein said demethylase 
cDNA expression is under the direction of mammalian 
promoters . 

6. The use of claim 5, wherein said promoter is 
CMV. 

7. The use of claim 4, wherein said demethylase 
cDNA expression is under plant specific promoters to 
alter methylation in plants and to allow for altering 
states of development of plants and expression of for- 
eign genes in plants. 

8. The use of claim 4, wherein said demethylase 
cDNA expression is in the antisense orientation to 
inhibit demethylase in cancer cells for therapeutic 
processes . 
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9. The use of claim 9, wherein expression of 
demethylase cDNA in mammalian cells is to alter their 
differentiation state and to generate stem cells for 
therapeutics, cells for animal cloning and to improve 
expression of foreign genes. 

10. The use of the expression of demethylase cDNA of 
claims 2 or 3 in bacterial or insect cells for produc- 
tion of large amounts of demethylase. 

11. The use of the expression of demethylase cDNA of 
claims 2 or 3 for the production of protein in verte- 
brate, insect or bacterial cells. 

12. The use of claim 11 for producing antibodies 
against demethylase . 

13 . The use of the sequence of demethylase cDNA of 
claim 2 as a template to design antisense oligonucleo- 
tides and ribozymes. 

14 . The use of the predicted peptide sequence of 

demethylase cDNA of claim 2 to produce polyclonal or 
monoclonal antibodies against demethylase. 

15 . The use of expression of cDNA of claim 2 or 3 in 
two hybrid systems in yeast to identify proteins inter- 
acting with demethylase for diagnostic and therapeutic 
purposes . 

16. The use of expression of cDNA of claim 2 or 3 in 
bacterial, vertebrate or insect cells to produce large 
amounts of demethylase for high throughput screening of 
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demethylase inhibitors for therapeutics and biotechnol- 
ogy and for obtaining the x-ray crystal structure. 

17. A volatile assay for high throughput screening 
of demethylase inhibitors as therapeutics and antican- 
cer agents which comprises the steps of: 

a) using transcribed and translated demethylase 
cDNA of claim 2 or 3 in vitro to convert methyl - 
cytosine present in methylated DNA samples to 
cytosine present in DNA and volatilize methyl 
group ; 

b) determining the absence or minute amount of 
volatilize methyl group as an indication of an 
active demethylase inhibitor. 

18. A volatile assay for the diagnostics of cancer 
in a patient sample which comprises the steps of: 

a) determining demethylase activity in patient sam- 
ples by determining conversion of methyl-cyto- 
sine present in methylated DNA to cytosine pres- 
ent in DNA and volatilization of the methyl 
group released as methanol; 

b) determining the presence or minute amount of 
volatilized methyl group as an indication of 
cancer in said patient sample. 

19. Use of an antagonist or inhibitor of DNA demeth- 
ylase of claim 1 or 2 for the manufacture of a medica- 
ment for cancer treatment, for restoring an aberrant 
methylation pattern in a patient DNA, or for changing a 
methylation pattern in a patient DNA. 

20. Use according to claim 19, wherein said antago- 
nist is a double stranded oligonucleotide that inhibits 
demethylase at a Ki of 50nM. 
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21. Use according to claim 20, wherein said oligonu- 
cleotide is fc m GC m GC m GC m G] . 

lG m CG m CG m CG m cJ n 

22. Use according to claim 19, wherein the inhibitor 
comprises an anti-DNA demethylase antibody or an 
antisense oligonucleotide of DNA demethylase or a small 
molecule. 

23. Use according to one of claims 19 or 22, wherein 
the change of the methylation pattern activates a 
silent gene. 

24. Use according to claim 23, wherein the activa- 
tion of a silent gene permits the correction of genetic 
defect . 

25. " Use according to claim 24, wherein said genetic 
defect is p-thalassemia or sickle cell anemia. 

26. Use of the demethylase of claim 1, for removing 
methyl groups on DNA in vitro. 

27. Use of the demethylase of claim 1 or its cDNA of 
claim 2, for changing the state of differentiation of a 
cell to allow gene therapy, stem cell selection or cell 
cloning . 

28. Use of the demethylase of claim 1 or its cDNA, 
of claim 2 for inhibiting methylation in cancer cells 
using vector mediated gene therapy. 

29. An assay for the diagnostic of cancer in a 
patient, which comprises determining the level of 
expression of DNA demethylase of claim 1 in a sample 
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from said patient, wherein overexpression of said DNA 
demethylase is indicative of cancer cells. 
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SEQUENCE LISTING 

<110> McGILL UNIVERSITY 
SZYF, Moshe 

BHATTACHARYA , San joy K. 
RAM CHAND AN I , Shyam 



<120> DNA DEMETHYLASE , THERAPEUTIC AND 
DIAGNOSTIC USES THEREOF 

<130> 1770-183"PCT" FC/ld 

<150> CA 2,220,805 
<151> 1997-11-12 

<150> CA 2,230, 991 
<151> 1998-05-11 

<160> 10 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1804 
<212> DNA 
<213> Unknown 

<400> 1 

ccgctctgcg ggcggggcgg gtctccggga ttccaagggc tcggttacgg aagaagcgca 60 

gagccggctg gggagggggc tggatgcgcg cgcacccggg gggaggccgc tgctgcccgg 120 

agcaggagga gggggagagc gcggcgggcg gcagcggcgc tggcggcgac tccgccatag 180 

agcagggggg ccagggcagc gcgctcgctc cgtccccggt gagcggcgtg cgcagggaag 240 

gcgctcgggg cggcggccgt ggccgggggc ggtggaagca ggcggcccgg ggcggcggcg 300 

tctgtggccg tggccgtggc cgtggccggg gtcggggccg tggccggggc cggggccggg 360 

gccgcggccg tccccagagt ggcggcagcg gccttggcgg cgacggcggc ggcggcgcgg 420 

gcggctgcgg cgtcggcagc ggtggcggcg tcgccccccg gcgggatcct gtccctttcc 480 

cgtcggggag ctcggggccg gggcccaggg gaccccgggc cacggagagc gggaagagga 54 0 

tggactgccc ggccctcccc cccggatgga agaaggagga agtgatccga aaatcagggc 600 

tcagtgctgg caagagcgat gtctactact tcagtccaag tggtaagaag ttcagaagta 660 

aacctcagct ggcaagatac ctgggaaatg ctgttgacct tagcagtttt gacttcagga 72 0 

ccggcaagat gatgcctagt aaattacaga agaacaagca gagactccgg aatgaccccc 780 

tcaatcagaa caagggtaaa ccagacctga acacaacatt gccaattaga caaactgcat 840 

caattttcaa gcaaccagta accaaattca cgaaccaccc gagcaataag gtgaagtcag 900 

acccccagcg gatgaatgaa caaccacgtc agcttttctg ggagaagagg ctacaaggac 960 

ttagcgcatc agatgtaaca gaacaaatta taaaaaccat ggagctacct aaaggtcttc 1020 

aaggagtcgg tccaggtagc aatgacgaga cccttctgtc tgctgtggcc agtgctttac 1080 

acacaagctc tgcgcccatc acaggacaag tctctgctgc cgtggaaaag aaccctgctg 1140 

tttggcttaa cacatctcaa cccctctgca aagctttcat tgttacagat gaagacatta 1200 

ggaaacagga agagcgagtc caacaagtac gcaagaaact ggaggaggca ctgatggccg 1260 

acatcctgtc ccgggctgcg gacacggagg aagtagacat tgacatggac agtggagatg 132 0 

aggcgtaaga atatgatcag gtaactttcg actgaccttc cccaagagca aattgctaga 1380 

aacagaatta aaacatttcc actgggtttc gcctgtaaga aaaagtgtac' ctgagcacat 1440 

agctttttaa tagcactaac caatgccttt ttagatgtat ttttgatgta tatatctatt 1500 

attccaaatg atgtttattt tgaatcctag gacttaaaat gagtctttta taatagcaag 1560 

cagggccctt ccggtgcagt gcagctttga ggccaggtgc agtctactgg aaaggtagca 1620 

cttacgtgaa atatttgttt cccccacagt tttaatataa acagatcagg agtaccaaat 1680 
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aagtttccca attaaagatt attatacttc actgtatata aacagatttt tatactttat 1740 
tgaaagaaga tacctgtaca ttcttccatc atcactgtaa agacaaataa atgactatat 1800 
tcac 1804 
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125 








Phe 


Pro 


Ser 


Glv 


Ser 


Ala 


Gly 


Pro 


Gly 


Pro 


Arq 


Gly 


Pro 


Arq 


Ala 


Thr 




130 










135 










140 










Glu 


Ser 


Glv 


Lvs 


Ara 


Met 


Asp 


Cys 


Pro 


Ala 


Leu 


Pro 


Pro 


Gly 


Trp 


Lys 


145 










150 










155 










160 


Lvs 


Glu 


Glu 


Val 


He 


Arq 


Lys 


Ser 


Gly 


Leu 


Ser 


Ala 


Gly 


Lys 


Ser 


Asp 










165 










170 










175 




Val 


Tyr 


Tyr 


Phe 


Ser 


Pro 


Ser 


Gly 


Lys 


Lys 


Phe 


Arg 


Ser 


Lys 


Pro 


Gin 








180 










185 










190 






Leu 


Ala 


Ara 


Tvr 


Leu 


Glv 


Asn 


Thr 


Val 


Asp 


Leu 


Ser 


Ser 


Phe 


Asp 


Phe 






195 










200 










205 








Arg 


Thr 


Gly 


Lys 


Met 


Met 


Pro 


Ser 


Lys 


Leu 


Gin 


Lys 


Asn 


Lys 


Gin 


Arg 




210 










215 










220 










Leu 


Arg 


Asn 


Asp 


Pro 


Leu 


Asn 


Gin 


Asn 


Lys 


Gly 


Lys 


Pro 


Asp 


Leu 


Asn 


225 










230 










235 










240 


Thr 


Thr 


Leu 


Pro 


He 


Arg 


Gin 


Thr 


Ala 


Ser 


He 


Phe 


Lys 


Gin 


Pro 


Val 










245 










250 










255 




Thr 


Lys 


Val 


Thr 


Asn 


His 


Pro 


Ser 


Asn 


Lys 


Val 


Lys 


Ser 


Asp 


Pro 


Gin 








260 










265 










270 






Arg 


Met 


Asn 


Glu 


Gin 


Pro 


Arg 


Gin 


Leu 


Phe 


Trp 


Glu 


Lys 


Arg 


Leu 


Gin 






275 










280 










285 








Gly 


Leu 


Ser 


Ala 


Ser 


Asp 


Val 


Thr 


Glu 


Gin 


He 


He 


Lys 


Thr 


Met 


Glu 




290 










295 










300 










Leu 


Pro 


Lys 


Gly 


Leu 


Gin 


Gly 


Val 


Gly 


Pro 


Gly 


Ser 


Asn 


Asp 


Glu 


Thr 


305 










310 










315 










320 


Leu 


Leu 


Ser 


Ala 


Val 


Ala 


Ser 


Ala 


Leu 


His 


Thr 


Ser 


Ser 


Ala 


Pro 


He 










325 










330 










335 




Thr 


Gly 


Gin 


Val 


Ser 


Ala 


Ala 


Val 


Glu 


Lys 


Asn 


Pro 


Ala 


Val 


Trp 


Leu 








340 










345 










350 






Asn 


Thr 


Ser 


Gin 


Pro 


Leu 


Cys 


Lys 


Ala 


Phe 


He 


Val 


Thr 


Asp 


Glu 


Asp 






355 










360 










365 
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He Arg Lys Gin Glu Glu Arg Val 

370 375 
Glu Ala Leu Met Ala Asp He Leu 
385 390 
Met Asp He Glu Met Asp Ser Gly 
405 



Gin Gin Val Arg Lys Lys Leu Glu 
380 

Ser Arg Ala Ala Asp Thr Glu Glu 
395 400 
Asp Glu Ala 
410 



<210> 3 

<211> 1589 

<212> DNA 

<213> Unknown 



<400> 3 

cacgcgcggg cgggtgggcg gagcggcccc cctagcgggg gctgtgaagc gcggggaggg 60 

ggccgagcgg gtggcgaagc cggcgcgcgc ccggctgggg gcggagggcg gaggcccgtg 120 

ggacagaaca gctgcggcga gtggcggcgg cggagggagc cgaatcggcg acgagcccgg 180 

gggtcgcaac ttgcagaagc ggcggcggcg gcggcatcgg ccacggcggg cggaaaagcc 240 

ggggcgcaat ggagcggaag aggtgggagt gcccggcgct cccgcagggc tgggaaaggg 300 

aagaagtgcc caggaggtcg gggctgtcgg ccggccacag ggatgtcttt tactatagcc 360 

ccagcgggaa gaagttccgc agcaagccac aactggcacg ttacctgggc ggatccatgg 420 

acctcagcac cttcgacttc cgcaccggaa agatgttgat gaacaagatg aataagagtc 4 80 

gccagcgtgt gcgctatgat tcttccaacc aggtcaaggg caagcctgac ctgaacaccg 54 0 

cgctgcctgt acggcagact gcatccatct tcaagcaacc ggtgaccaag atcaccaacc 600 

accccagcaa caaggtcaag agcgacccgc agaaggcagt ggaccagccg aggcagcttt 660 

tctgggagaa gaagctaagt ggattgagtg cctttgacat tgcagaagaa ctggtcagga 720 

ccatggactt gcccaagggc ctgcagggag tgggccctgg ctgtacagat gagacgctgc 780 

tgtcagccat tgcgagtgct ctacacacca gcaccctgcc cattacaggc cagctctctg 840 

cagccgtgga gaagaaccct ggtgtgtggc tgaacactgc acagccactg tgcaaagcct 900 

tcatggtgac agatgacgac atcaggaagc aggaggagct ggtacagcag gtacggaagc 960 

gcctggagga ggcactgatg gccgacatgc tagctcatgt ggaggagctt gcccgagacg 1020 

gggaggcacc actggacaag gcctgtgcag aggaggaaga ggaggaggaa gaggaggagg 1080 

aagagccgga gccagagcga gtgtagcaca ggtgccctgc ccaagtctgg gctgcagact 114 0 

gccttcagcc ttgcctggac caggtagggg ccagacctgt aggaggcagc cgtccacctc 1200 

ctttccaaag cctcctgctt ccaggtctca gtgcagggag cccctgtgga ccttgaactc 1260 

acttgtccct gcgctgcctg gcaggaagcc ccacactgaa agcagatgag cagtgaccca 1320 

actgagaggc cacctggaca cagtcacctc cctgcctcct tatcatagga caaggccttg 1380 

cttggcaccg aggagctggg agccgtgttg ggtgctggag gaagtttctg gaaacacacc 1440 

tggctatgcc caccttatgt ccctaaggct attacaggcc agggtttgga ctgctccggc 1500 

ccacagggct gcccagcctc cccacactga gggtcagcag cccaccagga agtcactttc 1560 

cttcaataaa ctgatggtag gaacttgtg 1589 



<210> 4 

<211> 291 

<2\2> PRT 

<213> Unknown 





<400> 


4 








Met 


Glu Arg 


Lys 


Arg 


Trp 


Glu Cys 


1 






5 






Arg 


Glu Glu 


Val 


Pro 


Arg 


Arg Ser 






20 








Val 


Phe Tyr 


Tyr 


Ser 


Pro 


Ser Gly 




35 








40 


Leu 


Ala Arg 


Tyr 


Leu 


Gly 


Gly Ser 




50 








55 


Arg 


Thr Gly 


Lys 


Met 


Leu 


Met Ser 


65 








70 





Pro 


Ala 


Leu 


Pro Gin Gly Trp Glu 




10 




15 


Gly 


Leu 


Ser 


Ala Gly His Arg Asp 


25 






30 


Lys 


Lys 


Phe 


Arg Ser Lys Pro Gin 








45 


Met 


Asp Leu 


Ser Thr Phe Asp Phe 








60 


Lys 


Met 


Asn 


Lys Ser Arg Gin Arg 






75 


80 
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Val 


Arcr 


Tyr Asp 


Ser 


Ser 


Asn 


Gin 


Val 


Lys 


Gly 


Lys Pro Asp 


Leu 


Asn 










85 










90 






95 




Thr 


Ala 


Leu 


Pro 


Val 


Arg 


Gin 


Thr 


Ala 


Ser 


He 


Phe Lys Gin 


Pro 


Val 








100 










105 






110 






Thr 


Lys 


lie 


Thr 


Asn 


His 


Pro 


Ser 


Asn 


Lys 


Val 


Lys Ser Asp 


Pro 


Gin 




.115 










120 








125 






Lys 


Ala 


Val 


Asp 


Gin 


Pro 


Arg 


Gin 


Leu 


Phe 


Trp 


Glu Lys Lys 


Leu 


Ser 


130 










135 










140 






Glv 


Leu 


Asn 


Ala 


Phe Asp 


He 


Ala 


Glu 


Glu 


Leu 


Val Lys Thr 


Met 


Asp 


145 










150 










155 






160 


LCU 


Pro 


Lys 


Gly Leu Gin Gly 


Val Gly Pro Gly 


Cys Thr Asp Glu 


Thr 










165 










170 






175 




Leu 


Leu 


Ser 


Ala 


He 


Ala 


Ser 


Ala 


Leu 


His 


Thr 


Ser Thr Met 


Pro 


He 








180 










185 






190 






Thr 




Gin 


Leu 


Ser 


Ala 


Ala 


Val 


Glu 


Lys 


Asn 


Pro Gly Val 


Trp 


Leu 






195 










200 








205 






Asn 


Thr 


Thr 


Gin 


Pro 


Leu 


Cys 


Lys 


Ala 


Phe 


Met 


Val Thr Asp 


Glu 


Asp 




210 










215 










220 






lie 


Arg 


Lys 


Gin 


Glu 


Glu 


Leu 


Val 


Gin 


Gin 


Val 


Arg Lys Arg 


Leu 


Glu 


225 










230 










235 






240 


Glu 


Ala 


Leu 


Met 


Ala 


Asp 


Met 


Leu 


Ala 


His 


Val 


Glu Glu Leu 


Ala 


Arg 










245 










250 






255 




Asp 


Gly 


Glu 


Ala 


Pro 


Leu 


Asp 


Lys 


Ala 


Cys 


Ala 


Glu Asp Asp Asp 


Glu 






260 










265 






270 






Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Pro 


Asp 


Pro Asp Pro 


Glu 


Met 




275 










280 








285 







Glu His Val 
290 



<210> 
<211> 
<212> 
<213> 



5 

1966 
DNA 

Unknown 



<400 
gggggcgtgg 
agaggeggtg 
tgatgcttgc 
aagggctegg 
cccgggggga 
cggcgctggc 
cccggtgagc 
gaagcaggcg 
gggacgggga 
tggeggegae 
ggagccggtc 
ggagagcggg 
gatccgaaaa 
taagaagttc 
cagttttgac 
actgegaaac 
aattagacaa 
taataaagtg 
gaagaggcta 
actacccaaa 
tgttgccagt 
ggaaaagaac 



> 5 
ccccgagaag 
gccggggcca 
gcgcgtcccc 
ttaeggaaga 
ggccgctgct 
ggcgactccg 
ggcgtgcgca 
ggccggggcg 
eggggceggg 
ggeggegget 
cctttcccgt 
aagaggatgg 
tctgggctaa 
agaagcaagc 
ttcagaactg 
gatcctctca 
acagcatcaa 
aaatcagacc 
caaggactta 
ggtcttcaag 
getttgeaca 
cctgctgttt 



gcggagacaa 
cgccccgggc 
cgcgcgccgc 
agcgcagcgc 
gcccggagca 
ccatagagca 
gggaaggege 
gcggcgtctg 
gccggggccg 
geggeggegg 
eggggagege 
attgcccggc 
gtgctggcaa 
ctcagttggc 
gaaagatgat 
atcaaaataa 
ttttcaaaca 
cacaacgaat 
gtgeatcaga 
gagttggtcc 
caagctctgc 
ggcttaacac 



gatggccgcc 
aggagggecg 
getgegggeg 
cggctgggga 
ggaggagggg 

ggggggccag 

teggggegge 
tggccgtggc 
cggccgtccc 
eggcageggt 
ggggccgggg 
cctccccccc 
gagcgatgtc 
aaggtacctg 
gectagtaaa 
gggtaaacca 
accggtaacc 
gaatgaacag 
tgtaacagaa 
aggtagcaat 
gccaatcaca 
atctcaaccc 



catagegett 
ctctgtgcgc 
gggegggtet 
gggggctgga 
gagagtgegg 
ggcagcgcgc 
ggccgtggcc 
eggggceggg 
ccgagtggcg 
ggcggcggcg 
cccaggggac 
ggatggaaga 
tactacttca 
ggaaatactg 
ttacagaaga 
gacttgaata 
aaagtcacaa 
ccacgtcagc 
caaattataa 
gatgagaccc 
gggcaagtct 
etctgeaaag 



ggaggaccta 
gcccgctcta 
ccgggattcc 
tgcgcgcgca 
egggeggcag 
tcgccccgtc 
gggggcggtg 
gccgtggccg 
gcagcggcct 
ccccccggcg 
cccgggccac 
aggaggaagt 
gtccaagtgg 
ttgatctcag 
acaaacagag 
caacattgcc 
atcatcctag 
ttttctggga 
aaaccatgga 
ttttatctgc 
ccgctgctgt 
cttttattgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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cacagatgaa 
agaagcactg 
aatggacagt 
aagrgaaaat 
aaaatgtacc 
tttttgatgt 
aattagtctt 
tgcaatctac 
agaacagatc 
tataaacata 
gtaaagacaa 



gacatcagga 
atggcagaca 
ggagatgaag 
tcctagaaat 
cgagcacata 
atatatctat 
ttgtaatatc 
tggaaatgta 
aggaattcta 
tttttatact 
ataaatgatt 



aacaggaaga 
tcttgtcgcg 
cctaagaata 
tgaacaaaaa 
gagcttttta 
tattcaaaaa 
aagcaggacc 
gcacttacgt 
aataaatttc 
ttattgaaag 
atattcacaa 



gcgagtacag 
agctgctgat 
tgatcaggta 
tgtttccact 
atagcactaa 
atcatgttta 
ctaagatgaa 
aaaacatttg 
ccagttaaag 
gggacacctg 
aaaaaaaaaa 



caagtacgca 
acagaagaga 
actttcgacc 
ggcttttgcc 
ccaatgcctt 
ttttgagtcc 
gctgagcttt 
tttcccccac 
attattgtga 
tacattcttc 
aaaaaa 



agaaattgga 
tggatattga 
gactttcccc 
tgtaagaaaa 
tttagatgta 
taggacttaa 
tgatgccagg 
agttttaata 
cttcactgta 
catcatcact 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1966 



<210> 6 
<211> 414 
<212> PRT 
<213> Unknown 



<400> 6 







Ala 


His 


Pro 


Glv 


Glv 


Glv 


1 








5 








\j J. y 


Glu 


Ser 


Ala 


Ala 


Glv 


Glv 


Ser 








20 










Glu 


Gin 


Glv 


Glv 


Gin 


Glv 


Ser 


Ala 






35 










40 


Vol 


ft TCI 




Glu 


Glv 


Ala 


Arcr 


Glv 




50 










55 




Lys 


Gin 


Ala 


Ala 


Jirn 


Glv 


Glv 


Glv 


65 










70 






Gly 


Arg 


Gly 


Arg 


Gly 


Arg 


Gly 


Arg 










85 








Pro 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 


Leu 








100 










Gly 


Gly 


Cys 


Gly 


Val 


Gly 


Ser 


Gly 






115 










120 


Pro 


Val 


Pro 


Phe 


Pro 


Ser 


Gly 


Ser 




130 










135 




Arg 


Ala 


Thr 


Glu 


Ser 


Gly 


Lys 


Arg 


145 










150 






Gly 


Trp 


Lys 


Lys 


Glu 


Glu 


val 


lie 










165 








Lys 


Ser 


Asp 


Val 


Tyr 


Tyr 


Phe 


Ser 








180 










Lys 


Pro 


Gin 


Leu 


Ala 


Arg 


Tyr 


Leu 






195 










200 


Phe 


Asp 


Phe 


Arg 


Thr 


Gly 


Lys 


Met 




210 










215 




Lys 


Gin 


Arg 


Leu 


Arg 


Asn 


Asp 


Pro 


225 










230 






Asp 


Leu 


Asn 


Thr 


Thr 


Leu 


Pro 


He 










245 








Gin 


Pro 


Val 


Thr 


Lys 


Phe 


Thr 


Asn 








260 










Asp 


Pro 


Gin 


Arg 


Met 


Asn 


Glu 


Gin 






275 










280 


Arg 


Leu 


Gin 


Gly 


Leu 


Ser 


Ala 


Ser 




290 










2 95 





TV yn 


10 


v-y a 


Pro 


Glu 


Gin 


Glu 
15 


Glu 


Glv 


Ala 


Glv 


Glv 


Asp 


Ser 


Ala 


He 


25 










30 






Leu 


Ala 


Pro 


Ser 


Pro 


Val 


Ser Gly 










45 








Glv 


Glv 


Arcr 


Glv 


Arg Gly Arg 


Trp 








60 










Val 


Cys 


Glv 


Atct 

rti y 


Gly Arg 


Gly Arg 






75 










80 


Gly 


Arg 


Gly 


Arg 


Gly Arg 


Gly Arg 




90 










95 




Gly 


Gly 


Asp 


Gly 


Gly Gly 


Gly Ala 


105 










110 






Gly 


Gly 


val 


Ala 


Pro 


Arg 


Arg Asp 










125 








Ser 


Gly 


Pro 


Gly 


Pro Arg 


Gly 


Pro 








140 










Met 


Asp 


Cys 
155 


Pro 


Ala 


Leu 


Pro 


Pro 
160 


Arg 


Lys 
170 


Ser 


Gly 


Leu 


Ser 


Ala 
175 


Gly 


Pro 


Ser 


Gly 


Lys 


Lys 


Phe 


Arg 


Ser 


185 










190 






Gly 


Asn 


Ala 


Val 


Asp 
205 


Leu 


Ser 


Ser 


Met 


Pro 


Ser 


Lys 
220 


Leu 


Gin 


Lys 


Asn 


Leu 


Asn 


Gin 
235 


Asn 


Lys 


Gly 


Lys 


Pro 
240 


Arg 


Gin 
250 


Thr 


Ala 


Ser 


He 


Phe 
255 


Lys 


His 


Pro 


Ser 


Asn 


Lys 


Val 


Lys 


Ser 


265 










270 






Pro 


Arg 


Gin 


Leu 


Phe 
285 


Trp 


Glu 


Lys 


Asp 


Val 


Thr 


Glu 
300 


Gin 


He 


He 


Lys 
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Thr Met Glu Leu Pro Lys Gly Leu 
305 310 
Asp Glu Thr Leu Leu Ser Ala Val 
325 

Ala Pro He Thr Gly Gin Val Ser 
340 

Val Trp Leu Asn Thr Ser Gin Pro 
355 360 
Asp Glu Asp He Arg Lys Gin Glu 

370 375 
Lys Leu Glu Glu Ala Leu Met Ala 
385 390 
Thr Glu Glu Val Asp He Asp Met 
405 



Gin Gly Val 


Gly Pro Gly 


Ser 


Asn 




315 










320 


Ala 


Ser Ala 
330 


Leu 


His 


Thr 


Ser 
335 


Ser 


Ala 


Ala Val 


Glu 


Lys 


Asn 


Pro 


Ala 


345 








350 






Leu 


Cys Lys 


Ala 


Phe 
365 


He 


Val 


Thr 


Glu 


Arg Val 


Gin 
380 


Gin 


Val 


Arg 


Lys 


Asp 


He Leu 
395 


Ser 


Arg 


Ala 


Ala 


Asp 
400 


Asp 


Ser Gly 
410 


Asp 


Glu 


Ala 







<210> 7 
<211> 2392 
<212> DNA 
<213> Unknown 



<400> 7 

agcgggccga ggagccgggc gcaatggagc ggaagaggtg ggagtgcccg gcgctcccgc 60 

agggctggga gagggaagaa gtgcccagaa ggtcggggct gtcggccggc cacagggatg 12 0 

tcttttacta tagcccgagc gggaagaagt tccgcagcaa gccgcagctg gcgcgctacc 180 

tgggcggctc catggacctg agcaccttcg acttccgcac gggcaagatg ctgatgagca 240 

agatgaacaa gagccgccag cgcgtgcgct acgactcctc caaccaggtc aagggcaagc 3 00 

ccgacctgaa cacggcgctg cccgtgcgcc agacggcgtc catcttcaag cagccggtga 360 

ccaagattac caaccacccc agcaacaagg tcaagagcga cccgcagaag gcggtggacc 420 

agccgcgcca gctcttctgg gagaagaagc tgagcggcct gaacgccttc gacattgctg 480. 

aggagctggt caagaccatg gacctcccca agggcctgca gggggtggga cctggctgca 540 

cggatgagac gctgctgtcg gccatcgcca gcgccctgca cactagcacc atgcccatca 600 

cgggacagct ctcggccgcc gtggagaaga accccggcgt atggctcaac accacgcagc 660 

ccctgtgcaa agccttcatg gtgaccgacg aggacatcag gaagcaggaa gagctggtgc 72 0 

agcaggtgcg gaagcggctg gaggaggcgc tgatggccga catgctggcg cacgtggagg 780 

agctggcccg tgacggggag gcgccgctgg acaaggcctg cgctgaggac gacgacgagg 84 0 

aagacgagga ggaggaggag gaggagcccg acccggaccc ggagatggag cacgtctagg 900 

gcagaggccc tgccgagagc ccgtgctgcc tgctggagcc gcctgcagac gcggtcctcg 960 

gccccacgtg aaccaggctc ggcggcgaag cccagccttg gagacaccca ggaggaaggc 102 0 

cgtgctcctg gctccctcct cggcccgtcc ccacttcccg gggcctcggg gcacacagct 1080 

ggggctgccc ccacccgaaa gaccctccac gctcgtcctc tacagagtcc ggcttcggga 1140 

agtgccgggt gctcctgggc cctgcctggc tccctacgac ctttgggctc gaggccagct 1200 

cctccccatg cccgctgtcc cagctccttg agactggaga gcagccagca ggtgcccggc 1260 

agctcggcgc cacggcttgc tgacagctgg gagggtttct cggtctggag gcgtagtttt 1320 

gaaactcaca tcacccactg tgcagcgtga ggacgggact ctggtctgct gtggggggca 1380 

tgcaggacgg cgccactctc tgccctgcca tgcggctggt ggtgccacag agcctcaccg 1440 

tgcctgagtg gcgtgcccag ggaggccgct ctccttcagt aaatgtaaca cagtcgaggc 1500 

acgtcatcgg gcagccttcc ctgtgtgcca acgccagcct tcgcttctga aaaccaaact 1560 

ccagccgctg ccagtcggga cttggtcgcc cggcgctgcc agaatgctcc actgccagcc 162 0 

ggcccccctg cctcggtttc ccttctgttt agtggcgaca caggcaccca gctttggggt 1680 

ggtgctgacg ctcccagggg tgccaggagc cactgggaca gggtgaggct cccagacgct 174 0 

cctcgaggtg cccagctctc cagggagctt ctggcccaag gcgttcttga gggatctgct 1800 

ccttaacccc ccagtgcctt ggcgagggca ggttccaagc cacagacgcc tgccccgagt 1860 

ggactttgcg gccagtccct gggtgccttc ctgggccctg cttgcccagt gagggttcct 192 0 

aacgggtggg ttcawtggcc tggcccvagc gagcccccac ctgcattgac cttaggccca 198 0 

tagagagggc ctgtcccggt gctgccccag ccaaggatct ggtcgctgcc ccagggggac 2 04 0 

tgatgggcaa gagtcgcccc tgtggctgga ctgtgaccat ccctgatggg gcctgaccgc 2100 

gggagctgag gaagcgccgc tccaccgtct gccctccaag gacccgcatg gaggcagtgg 216 0 
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